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FOREWORD 

| 

\ 

This  is  part  one  of  a  final  report  on  work  sponsored  by  the  Aerospace  Research 
Laboratories,  Office  of  Aerospace  Research,  United  States  Air  Force,  under  Contract 
AF  33(616) -7572,  Project  7O71,  Task  7071-01,  The  technical  effort,  conducted  at 
Rocketdyne  in  the  Research  Department,  was  monitored  by  Dr.  H.  Leon  Harter  of  the 
Aerospace  Research  Laboratories . 

i 

The  initial  motivation  of  the  overall  research  program  was  to  develop  methods 
for  finding  optimum  experimental  designs,  taking  into  account  cost  as  well  as 
variance  properties.  Loss  functions  which  included  quadratic  and  linear  cost  terms 
were  used  for  designs  based  on  simple  regression  models.  Explicit  minimizations  of 
this  loss  function  were  obtained  in  certain  simple  cases  (e.g.,  strictly  linear 
multivariate  model)  and  some  interesting  general  nonoptimality  properties  of 
orthogonal  designs  were  discovered.  Treatment  of  more  general  cases,  however,  met 
with  excessive  computational  difficulty  and  for  this  reason  the  scope  of  the 
original  program  was  expanded  to  include  the  use  of  Bayesian  decision  theory  and 
finally  more  recently  the  application  of  the  methods  of  linear  programming.  Hew 
tasks  were  subsequently  added  in  the  areas  of  expansible  and  contractible  designs, 
multivariate  quanta!  response  problems,  and  most  recently  estimation  questions  in 
reliability  growth  models . 

The  research  on  loss  functions  has  been  described  in  detail  in  previous  ARL 
Technical.  Documentary  Reports  (ARL  62-373,  ARL  65-IO7)  and  no  further  mention  will 
be  made  of  it  here.  Results  obtained  in  the  other  investigations  during  the  past 
two  years  (ending  February,  1965)  are  reviewed  in  the  three  parts  of  this  report. 
Fart  two  deals  with  formulations  of  reliability  growth  models  and  statistical 


li 


estimation  of  parameters  of  the  stochastic  processes  resulting  from  such  models . 
Part  three  involves  the  design  and  analysis  of  sensitivity  experiments  in  which 
there  are  one  or  more  stimulus  variables . 

Portions  of  the  work  reported  in  part  one  (Appendices  A,  B,  and  E)  were  based 
on  dissertation  research  by  the  author  while  he  was  a  graduate  student  at  the 
University  of  Chicago.  The  author  acknowledges  the  assistance  of  Hr.  George 
Uglean  on  various  parts  of  this  research. 
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ABSTRACT 


This  report  describes  results  of  research  on  factorial  designs  during  a  two 
year  period  ending  in  February,  1965 .  These  include  (a)  characterizations  of 
orthogonal  and  two  classes  of  non-ocrth agonal  designs  as  solutions  co  linear  con¬ 
straints,  (b)  optimality  properties  of  orthogonal  designs,  (c)  development  of  a 
general  class  of  non-orthogonal  sequential  factorial  designs,  (d)  results  on 
certain  families  of  2n  designs,  and  (e)  description  of  a  special-purpose  linear 
programing  computer  routine  for  combinatorial  problems  in  experimental  design. 
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Results  obtained  along  two  lines  of  study  are  given  in  this  report.  The  first 
is  concerned  with  a  special  class  of  sequential  incomplete  factorial  designs,  termed 
"expansible  and  contractible" .  The  study  of  this  class  was  motivated  by  the  need 
for  designs  which  do  not  lose  their  desirable  properties  if  they  are  stopped  pre¬ 
maturely  or  if  new  factors  are  added  after  the  experiment  has  been  initiated. 

While  any  design  can  be  ordered  so  that  the  factors  are  Introduced  sequentially, 
not  all  designs  have  the  contractibility  property,  namely  that  truncated  designs 
retain  relatively  high  efficiency.  In  particular,  orthogonal  designs  do  not  usually 
provide  convenient  expansible  series  of  designs.  The  class  of  one -at-a~  time 
designs  is  a  convenient  expansible  series,  but  the  variance  efficiencies  of  such 
designs  are  poor.  The  general  class  of  permutation- invariant  designs  is  defined 
and  discussed  in  detail,  and  it  is  shown  that  many  expans ible  series  can  be  con¬ 
structed  within  this  class .  The  variances  and  efficiencies  for  many  resolution** 
five  2n  designs  are  given,  and  the  best  expansible  series  are  indicated. 

The  second  investigation  involves  the  use  of  linear  programming  for  finding 
incomplete  factorial  designs .  Details  of  the  simplex  method  are  reviewed,  and  a 
special-purpose  linear-programming  computer  routine  for  finding  factorial  designs 
is  described.  It  is  shown  how  the  special  features  of  the  problems,  such  as  the 
size  and  condition  of  the  basis  matrices  and  the  degeneracy  of  the  solution',,  are 
exploited  in  the  computer  routine.  It  has  been  vised  successfully  to  derive  several 
known  designs  which  involve  constraint  matrices  of  moderate  size.  On  larger  prob¬ 
lems,  computational  difficulties  and  non- integral  solutions  have  precluded  obtain¬ 
ing  new  designs . 


Manuscript  released  by  the  author  May  1965  for  publication  as  an 
ARL  Technical  Report. 
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This  report  has  as  appendices  five  papers  which  had  previously  been  issued 
only  as  internal  reports  at  Rocketdyne .  Appendix  A  deals  with  the  formulation  of 
combinatorial  problems  in  experimental  design  as  linear  programming  problems. 
Appendix  B  contains  a  survey  of  non -orthogonal  designs,  Introduces  the  classes  of 
permutation- invariant  and  cluopvise -orthogonal  designs,  and  giver  linear- 
programming  characterizations  of  these  classes .  Appendix  C  is  a  paper  on  optimality 
properties  of  orthogonal  designs  which  was  presented  at  the  Chicago  meeting  of  the 
American  Statistical  Association  in  December  of  1964.  Appendix  D  is  an  investiga¬ 
tion  of  the  class  of  one -at-a- time  designs .  Appendix  £  contains  .results  on  Incom¬ 
plete  factorial  designs  originally  reported  at  the  Urbana  meeting  of  the  Institute 
of  Mathematical  Statistics  ir  November  of  1961. 
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EXPANSIBLE  AID  CCKERACTIBI£  EBSID® 


OTBODUCTIOH 

If  an  ordinary  incoagletc  factorial  design  la  baited  prematurely,  most  of  the 
information  is  loot.  In  industrial  research  and  development  progress,  there  are 
■any  reasons  why  a  planned  experiment  may  be  stopped,  including  budgetary  changes, 
failure  of  test  equipment,  pressure  for  results,  or  a  shift  in  experimental 
emphasis  as  it  becomes  obvious  that  a  factor  should  be  included  or  excluded*  The 
uncertainty  of  the  future  of  a  program  often  leads  an  experimenter  to  resist  the 

apparent  rigidity  of  a  formal  statistical  test  plan.  The  concept  of  designs  which 

r 

introduce  the  factors  sequentially,  which  we  have  chosen  to  call  expansible  and 
contractible  designs,  has  been  developed  to  correct  this  deficiency  in  the  designs 
currently  available  in  the  literature  * 

i  A  cootractltle  18  “  or*ered  ,pecmc‘tlon °f  cc"bl“ti0“B 

I  at  which  runs  are  to  be  made  which  has  the  property  that  if  the  experiment  is 

i 

halted  prematurely,  quantitative  Inferences  about  the  effects  of  some  of  the 
|  factors  can  still  be  made .  Contract ib ility  can  be  achieved  by  varying  only  a  few 

j 

j  of  the  factors,  preferably  those  thought  beforehand  to  have  the  largest  effects, 
in  the  early  part  of  the  experiment.  If  the  factors  which  are  held  fixed  do  not 
interact  with  those  which  are  varied,  then  the  effects  of  the  factors  which  have 
been  varied  can  be  estimated  without  bias  *  The  unestimated  effects  are,  however, 
aliased  with  the  estimates  of  the  mean. 

Any  design  is  expansible  since  any  design  can  be  repeated  with  .a  previously 
fixed  factor  at  another  level.  It  is  useful  to  apply  the  term,  however,  to  series 
j  of  designs  any  one  of  which  can  he  contracted  to  the  next  lower  or  expanded  to  the 
§  next  higher  design  in  the  series . 


As  example  of  these  ideas  ve  consider  a  simple  expansible  and  contractible 
series  for  up  to  three  two-level  factors .  For  one  factor  the  design  is  simply 

0 

where  0  represents  the  low  level  and  1  the  high  level  of  the  single  factor. 
Now  suppose  a  second  factor,  which  was  at  its  low  level  during  the  first  two  runs, 
is  varied.  The  design  is 

0  0 
10 

0  1  , 

of  which  the  first  two  runs  have  already  been  done .  A  third  factor  (at  its  low 
level  for  the  first  three  runs)  may  now  be  added  in  two  additional  runs,  so  that 
the  complete  design  Is 

0  0  0 
10  0 
0  10 
0  0  1 
111 

In  unpublished  work  this  design  has  been  shown  by  the  methods  of  Appendix  E  to  be 
the  five-run  design  for  which  the  variance  of  each  estimate  is  minimized.  After 
the  first  two  tests,  note  that  the  estimate  of  the  mean  is  aliased  with  the  main 
effects  of  the  second  and  third  factors,  but  the  estimate  of  the  main  effect  of 
the  first  factor  is  not  aliased.  Similarly,  after  three  runs  the  main  effects  of 
the  first  and  second  factors  can  be  estimated  while  the  effect  of  the  third  is 
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aliased  with  the  mean*  After  four  runs  the  effects  of  all  three  factors  can  be 
estimated,  but  the  addition  of  the  fifth  test  Improves  the  efficiency  of  the  esti¬ 
mates  considerably*  Note  that  the  last  four  tests  comprise  a  half -replicate  of  a 
2  ,  which  is  the  standard  design  to  use  for  three  noninteractive  factors*  This 
half-replicate  is  not  contractible,  however,  and  no  estimates  can  be  calculated 
until  after  all  four  responses  have  been  observed* 

Belov  is  described  the  general  theory  applicable  to  expansible  and  contracti¬ 
ble  designs  for  2n  experiments*  The  use  of  expansible  series  of  orthogonal  designs 

/ 

is  discussed  as  veil  as  one-at-a-tlme  designs*  Finally,  the  properties  of  a 
general  class  of  such  designs,  permutation- invariant  designs  of  resolution  5,  are 
derived  In  detail* 

EXPANSIBLE  AN D  CCfTERACTIBLE  LESIONS  FOR  WO -LEVEL  FACTORS 
General  Theory 

The  treatment  combinations  of  any  incomplete  2n  design  can  be  arranged  in 
such  an  order  that  the  factors  are  introduced  sequentially.  Here  and  in  the 
sequel,  a  factor  is  considered  to  be  introduced  at  that  run  in  which  the  factor 
first  appears  at  its  high  level*  If  one  considers  the  treatment  combinations  as  n-blt 
binary  numbers,  an  arrangement  with  this  property  may  be  obtained  by  placing  the 
treatment  combinations  in  increasing  numerical  order*  If  the  factors  are  arranged 
in  order  of  increasing  a  priori  Importance  (increasing  magnitude  of  effect),  then 
the  order  of  introduction  is  the  inverse  of  the  order  of  Importance*  A  design 
ordered  in  this  way  may  or  may  not  be  contractible  depending  an  whether  or  not  the 
subdeaigns  are  singular*  For  example,  the  subdesigns  of  the  half -replicate  of  a 
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0  0  1 
0  10 
10  0 
111 

are  singular . 

For  2n  designs  it  can  be  shown  that  the  variance  of  each  estimate  is  mini- 
mixed  if  and  only  if  the  design  is  oxthogonal.  It  is  natural  to  inquire  therefore, 
whether  orthogonal  designs  are  contractible  and  expansible .  Since  in  an  orthog¬ 
onal  incomplete  2n  design  the  high  level  appears  in  half  the  tests,  an  orthogonal 
design  would  necessarily  be  contracted  to  half  its  number  of  runs  or  expanded  to 
twice  its  number  of  runs  by  the  deletion  or  addition  of  a  factor.  Therefore,  if 
there  is  pressure  to  hold  the  niaaber  of  tests  to  a  minimum,  it  will  usually  not  be 
useful  to  restrict  attention  to  orthogonal  designs . 

There  are  several  exceptions  to  the  general  rule  that  the  contractibllity 
property  is  not  satisfied  by  the  orthogonal  design  which  has  the  smallest  number 
of  runs  for  a  given  experiment.  The  only  exception  for  resolution  3  designs  (i.e 
those  for  estimating  only  main  effects)  is  the  trivial  case  of  two  factors,  for 
which  the  four-run  design  can  be  contracted  to  the  minimal  two-run  one- 

factor  design.  There  are  several  exceptions  for  resolution  3  designs  (those  for 
estimating  main  effects  and  two-factor  interactions)  •  These  cases  are:  expanding 
from  one  up  through  four  factors;  expanding  from  five  up  through  seven  factors; 
and  expanding  from  eight  to  nine  factors .  In  all  these  cases  the  smallest  orthog¬ 
onal  design  containing  1+1  factors  contains  tvice  as  many  runs  as  the  smallest 
for  1. 
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The  optimality  property  vhich  leads  one  to  select  orthogonal  designs  is 
based  on  variance  considerations .  Such  considerations  say  be  subordinate  to 
others  in  research  and  development  programs,  in  vhich  It  often  happens  that  the 
cost  per  test  is  large  and  the  error  of  an  individual  observation  is  smaller  in 
magnitude  than  the  effects  of  Interest.  In  the  extreme  case  vhere  the  error 
variance  is  zero,  the  only  relevant  criterion  for  a  design  is  its  "degree -of - 
freedom"  efficiency,  defined  as  the  ratio  of  the  n umber  of  parameters  to  the 
number  of  runs .  In  evaluating  2n  designs  one  nay  take  an  intermediate  point  of 
view  and  consider  the  efficacy  of  designs,  vhich  is  the  product  of  the  degree- 
of -freedom  efficiency  and  the  prediction  index,  defined  in  another  report  [  3  ] . 

The  prediction  index  for  2n  designs  reduces  to  the  reciprocal  of  the  product  of  the 
number  of  runs  and  the  average  variance  of  an  estimate .  The  has  its  maxi¬ 

mum  value  of  unity  for  orthogonal  (equal-frequency)  designs. 

One  more  concept  vhich  is  useful  in  developing  expansible  series  of  designs 
is  the  "guaranteed  minimum"  •  It  may  often  be  assimmd  that  the  experiment  vill  not 
be  stopped  before  a  certain  number  of  runs  have  been  made,  vhich  number  ve  refer 
to  as  the  guaranteed  minimum.  It  is  then  not  necessary  to  require  that  the  a*»«ign 
be  contractible  beyond  this  minimum.  If  a  design  has  no  guaranteed  asso¬ 

ciated  vith  it,  then  It  in  said  to  be  fully  contractible. 

One-At-A-Tlme  Design* 

A  2n  design  containing  n+l  runs  in  vhich  the  i-th  factor  is  introduced  at 
the  (l+l)-st  run  is  called  a  one-at-a-tlme  design.  Such  designs  are  of  resolution 
3,  are  fully  contractible,  and  are  perfectly  efficient  in  the  degree -of -freedom 
sense.  Their  variance  properties  are,  however,  knows  to  be  poor.  In  exploratory 

^Thie  term  and  the  concept  were  suggested  to  the  author  by  Cuthbert  Daniel. 
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research  vlth  lew  error  their  degree -of- freedom  efficiency  and  contract  ibllity  nay 
still  lake  then  useful  designs .  In  another  paper,  which  is  included  as  Appendix  D, 
it  was  shown  that  fo2  is  a  lover  hound  for  the  variances  of  the  estimates 
obtained  from  such  designs,  and  that  the  lower  bound  is  attained  for  all  the  esti¬ 
mates,  grand  mean  and  main  effects,  for  one  particular  expansible  series .  It  has 
also  been  demonstrated  that  a  much  wider  class  of  one-at-a-tlme  designs  has  the 

property  that  the  variances  of  the  main-effect  estimates  attain  the  lower  bound. 

* 

An  upper  bound  on  the  efficacy  of  such  designs  is  2/(n+l) . 


PKHMUTATIOH-IHVABIAMT  ISS2ESBS 
Properties  of  the  Designs 

A  class  of  designs  which  often  provides  convenient  expansible  series  is  the 
class  of  permutation- invariant  designs  defined  in  another  paper,  which  is  included 
as  Appendix  B.  For  2n  experiments  such  a  series  is  characterized  by  a  set  of  k 
integers  (a^,  *“  ,  m^},  where  for  convenience  the  will  be  arranged  in  non¬ 
decreasing  order .  For  n  i  factors  the  design  contains  those  (^  )  treatment 
ccmblnations  with  exactly  m^  factors  at  their  high  level,  plus  those 
binations  with  exactly  factors  at  their  high  Xml,  •  •  •  ,  plus  all  those 
combinations  with  exactly  m^  factors  at  their  high  level. 
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fluch  designs  have  the  property  of  treating  all  factors  alike .  That  is,  the 
variances  of  all  main  effects  are  equal,  the  variances  of  all  two- factor  interac¬ 
tions  are  equal,  and  covariances  between  analogous  pairs  of  estimates  are  equal. 
The  cross-product  matrices  have  a  particular  pattern  ifelch  allows  general  expres¬ 
sions  for  the  elements  of  the  inverse  to  be  obtained .  Consider  first  designs  for 
estimating  only  the  grand  mean  and  main  effects,  for  which  the  cross-product 
matrix  Is  as  follows: 
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A  A  A  A  A 


A 

A 

A 

A 

A 


a  B  B  3  B 
B  a  B  B  B 
B  B  a  B  B 
B  B  B  a  B 
B  B  B  B  a 


*  * 

Although  this  is  only  a  five-factor  example,  it  serves  to  illustrate  the  general 


case.  The  three  constants 


a.  A,  and  B  cure  respectively  the  maker  of  runs,  the 


inner  product  of  a  coefficient  vector  for  a  sain  effect  vith  one  for  the  seen,  and 


the  inner  product  of  coefficient  vectors  for  tvo  naln  effects. 


In  for  estimating  the  grand  mean,  main  effects,  and  tvo-f&ctor  inter¬ 
actions,  the  cross-product  Matrices  have  a  somewhat  acre  complex  structure,  which 
can  he  illustrated  by  a  four-factor  examples 


H 

A 

A_ 

A 

_A 

B 

B 

B 

B 

B 

_B 

A 

I 

B 

B 

B 

A 

A 

A 

C 

C 

C 

A 

B 

I 

B 

B 

A 

C 

C 

A 

A 

C 

A 

B 

B 

I 

B 

□ 

A 

C 

A  . 

C 

A 

A 

B 

B 

B 

_M 

1 

C 

A 

C 

A 

_A 

B 

A 

A 

C 

C 

X 

B 

B 

B 

B 

D 

B 

A 

C 

A 

C 

B 

X 

B 

B 

D 

B 

B 

A 

C 

C 

A 

B 

B 

I 

D 

B 

B 

B 

C 

A 

A 

C 

B 

B 

X) 

X 

B 

B 

B 

C 

A 

C 

A 

B 

B 

B 

B 

X 

B 

B 

c 

C 

A 

A 

D 

B 

B 

B 

B 

X 

Assigning  arable  numbers  to  the  factors  and  representing  interactions  by  a  pair  of 
numbers,  the  roam  and  columns  of  this  matrix  are  ordered  so  that  they  are  asso¬ 
ciated  with  the  mean,  factors  1  through  4,  and  interactions  in  the  order  12,  13, 

14,  23,  24,  34.  Here  X,  A,  and  B  are  defined  m  before  in  the  part  of  the  matrix 


9 


associated  with  the  Mtt  end  min  effect*  (the  first  five  rows  end  columns) .  Due 
to  the  structure  of  2  designs,  they  also  appear  elsewhere  1a  the  aatrlx.  A 
appease  as  the  laser  product  of  coefficient  vectors  for  the  main  effect  of  a 
factor  and  an  Interaction  containing  that  factor  (for  example,  1  with  12,  3  with 
23,  etc .) .  £  Is  the  Inner  product  of  coefficient  vectors  for  an  Interaction  and 
for  the  grand  man,  and  of  two  coefficient  vectors  for  Interactions  having  one 
factor  In  won  (e.g.,  12  with  13),  as  well.  The  constant  C  Is  defined  as  the 
Inner  product  of  coefficient  vectors  for  a  aaln  effect  of  one  factor  and  for  an 
Interaction  Involving  second  and  third  factors  (e  .g .,  2  with  13)  .  Finally,  D  Is 
the  Inner  product  of  coefficient  vectors  for  two  interactions  Involving  a  total  of 
four  factors  (13  with  24,  etc.). 

General  expressions  for  the  five  constants  are  as  follows  for  the  n- factor 
design  of  the  general  series  {jSj,  •••  ,  n^}: 

»  •  «$)•*<£,)<  <$)  *  <£> 

with  the  convention  that  when  a  binomial  coefficient 
q  <  0  or  p  <  q.  Its  value  is  zero.  The  derivation 
othexj  being  eonpletely  analogous. 


<£> . 

is  undefined  because 
for  B  will  be  given,  the 


IX) 


From  an  examination  of  the  coefficient  Matrix  X,  given  any  two  factor*, 
one  can  see  that  B  la  the  nuaber  of  occurrences  of  00  minus  twice  the  number 
of  occurrences  of  01  plus  the  nuaber  of  occurrences  of  11.  Considering  only 


those  treatment  combinations  with  m^  factors  at  their  high  levels,  there  are 
exactly  combinations  with  two  factors  fixed  at  levels  1  and  1,  because 


there  are  m ,  -2  more  l's  to  be  assigned  to  the  n-2 


lining  factors.  Simi¬ 


larly,  the  number  of  occurrences  of  0  and  1  for  two  fixed  factors  la  ^  , 

and  the  mmber  of  occurrences  of  0  and  C  la  (“”2  J.  Combining  these  and 
summing  over  1  gives  the  above  expression. 


Covariance  Matrices  of  Permutation-Invariant  Designs 

The  covariance  matrix  (the  Inverse  of  the  cross-product  matrix)  for 
permutation- Invariant  designs  also  exhibits  a  distinctive  pattern.  Tor  main- 
effect  designs  the  Inverse  baa  the  following  form,  again  illustrated  by  a  five- 
factor  example: 


The  constant  c*  times  o2  (the  constant  error-variance  of  the  errors  In  the 

Observations)  la  the  variance  of  the  estimate  of  the  mean,  yo2  la  the  variance 

2 

of  an  estimated  main  effect,  0o  la  the  covariance  between  the  estimated  swan  and 
an  estimated  effect,  and  8o2  is  the  covariance  between  the  estimates  of  two  main 
effects . 


U 


Because  of  the  patterned  structure  of  these  Matrices,  a  set  of  five  distinct 
equations  in  the  four  unknowns  Of,  0,  7,  and  6  can  he  obtained  by  formally  Multi¬ 
plying  the  cross-product  Matrix  by  the  covariance  Matrix.  For  the  general  case  of 
n  factors  the  equations  are  as  follows: 

■Of  ♦  n  A0  -  1 
AO  +  [K  +  (n-l)B]  0-0 
+  Ay  +  (n-l)AB  -  0 
A0  +  ly  +  (n-l)B5  -  1 
A0  +  By  +  Cl  +  (n-2)B]  6-0 

These  equations  are  consistent,  and  the  explicit  solutions  are  as  follows: 

O  «  [I  +  (n-l)B]/A 
0  -  -A/A 

>  -  [(n-l)(HB-A2)  +  I(M-B)]/(H-B)A 
B  «  -  (HB-A2)/(*-B)A  , 

where  &  is  given  by  A  *  n(lB-A2)  +  l(l-B)  . 

The  covariance  Matrix  for  the  estlMation  of  the  Mean,  aaln  effects,  and  tvo- 
f actor  Interactions  is  patterned  like  the  cross-product  Matrix  for  this  case*.  Its 
elements  will  be  denoted  by  snail  Latin  letters .  The  four-factor  example  is  as 
follows: 
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b 

b 

b 

Jb 

c 

c 

c 

_c 

c 

c 

b 

d 

e 

e 

e 

f 

f 

f 

g 

g 

g 

b 

e 

d 

e 

e 

f 

g 

g 

f 

f 

g 

b 

e 

e 

d 

e 

g 

f 

g 

f 

g 

f 

b_ 

e 

e 

e 

__d 

f 

g 

f 

g 

f 

_f 

c 

f 

f 

g 

g 

h 

1 

1 

i 

i 

Z 

c 

f 

g 

f 

g 

i 

h 

i 

i 

J 

i 

c 

f 

g 

g 

f 

i 

i 

h 

J 

i 

i 

c 

g 

f 

f 

g 

i 

i 

J 

h 

i 

i 

c 

g 

f 

g 

f 

i 

J 

1 

i 

h 

i 

c 

g 

g 

f 

f 

J 

i 

i 

i 

i 

h 

Again  the  Matrix  and  its  inverse  nay  be  formally  Multiplied,  to  obtain  a  set  of 
fourteen  consistent  equations  in  the  ten  unknowns.  Biese  equations  are  as  follows: 


Ha  +  nAb  +  (g)Bc  »  1 

Aa  +  [lh-(n-l)B]b  +[(n-l)A  +  (“g^Clc  »  0 
Ba  +  [2A+(n-2)c]b  +[»+2(n-2)B  +  (n“2)D]c  -  0 

Mb  +  Ad  +  (n-l)Ae  +  (n-l)Bf  +  (^jBg  «  0 

Ab  +  «d  +  (n-l)Be  +  (n-l)Af  +  (“g^Cg  *  1 

Ab  +  Bd  +  [*».(n-2)B]e  +  [A+(n-2)c]f  +  C(n-2)A  +  (“^Jclg  -  0 

Bb  +  Ad  +  [A*(n-2)C>  +  [Mf(n-2)B]f  +  [(n-2>B  +  (n“2)D]g,  «  0 

Bb  +  Cd  +  [2Af(n-3)C]e  +  [2B+(a-3)D]f  +  [M+2(b-3)B  +  (n“5)D]g  «  0 


Me  +  2Af  +  (n-2)Ag  +  Bh  +  2(a-2)Bi  +  -  0 
Ac  +  (»*B)f  +  (n-2)Bg  +  Ah  ♦  (n-2)(A^)i  +  (ng2)CJ  ”  0 
Ac  +  ZBf  +  [M*-(a-3)B]g  +  Ch  +  [2A+2(n-3)c]i  +  [(n-3)A+(n~5)c],J  *  0 
Be  +  2Af  +  (n-2)Cg  +  Hh  +  2(a-2)Bi  +  (n’2)DJ  «  1 
Be  +  (A+C)f  +  [A+(a-3)C]g  +  Bh  +  [Mf(n-2)&f(n-3)D]i  +  *  0 


Be  +  2Cf  +  [2A+(n-4)c]g+  Bh  +  [4B*2(n-4)D]i  +  [M+2(n-4)B+(n“lf)D]j  »  0 
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Attempt a  hare  teen  wade  to  obtain  explicit  solutions,  but  the  algebraic  complexity 
precludes  Obtaining  simple  expressions  for  the  Inverse  elements*  It  Is  far  easier 
to  evaluate  and  solve  the  above  equations  for  a  particular  ?ase,  and  a  s lapis 
computer  routine  has  been  written  to  do  this . 

HUME&ICAL  SXSULXS  OH  BBSCMtflQM-ITVK  DBSIEEKB 

numerical  results  have  been  Obtained  for  a  large  nuaber  of  resolution-five 
permutation- Invariant  designs,  and  these  are  given  In  Table  I.  The  table  lists 
the  series  (that  is,  the  m^'s);  number  of  factors;  the  constants  I,  A,  B,  C,  and 
D;  the  efficiency  of  the  estiaates  of  the  Mean,  sain  effects,  and  Interactions; 
the  prediction  index;  the  degree -of- freedom  efficiency;  and  the  efficacy.  An 
examination  of  the  table  reveals  that  the  best  series  In  terms  of  efficacies  of 
12»ose  tried  are  as  follows  for  various  values  of  the  nuaber  of  runs  guaranteed  to 
be  made. 


guaranteed  minimum 

best  series 

n 

efficacy  in  4> 

4 

to, 1,2) 

2 

100 

3 

57 

4 

23 

5 

IX) 

7 

(1,2,5) 

3 

57 

4 

52 

5 

29 

6 

14 

11 

{1,2,4) 

4 

67 

5 

69 

6 

50 

7 

28 

16 

(1,5,5) 

5 

100 

6 

69 

7 

45 

22 

(0,2,5) 

6 

89 

7 

58 

8 

35 

14 


TABLE  I 


EVALUATE®  OF  FBS(UTATIC®-ISVARIAIT  SE8DBSB 


Efficiency 


of 

of  ■sin 

of  Inter- 

d.f.  effi- 

effi- 

Series 

n 

V  A  B 

C 

mm 

effects 

actions 

P.I. 

ciency 

1I35H 

{0,1,2} 

2 

4  0  0 

0 

0 

100 

100 

100 

100 

100 

100 

5 

7  -1  -1 

-1 

-1 

57 

57 

57 

57-14 

100 

57 

4 

11  -3  -1 

1 

3 

10 

18 

36 

22 .86 

100 

23 

5 

16  -6  0 

2 

0 

2 

6 

25 

9.63 

100 

10 

6 

22  -10  2 

2 

-2 

1 

3 

18 

4.62 

100 

5 

{0,1,3} 

5 

16  -2  0 

-2 

0 

64 

45 

a 

56.69 

100 

57 

6 

27  -5  -1 

-1 

3 

9 

15 

45 

25.56 

81.48 

21 

7 

43  -11  -1 

1 

3 

2 

5 

31 

10.82 

67.44 

7 

rng 

-3 

3 

94 

65 

65 

67.29 

EX) 

67 

-1 

5 

49 

57 

58 

57.39 

16.19 

44 

2 

4 

13 

27 

46 

35.60 

61  Jl 

22 

5 

1 

36 

45 

69 

59*32 

43.26 

26 

{1,2,3} 

5 

7  1-1 

1 

-1 

57 

57 

57 

57 .14 

100 

57 

4 

14  0  -2 

0 

-2 

57 

91 

57 

66.17 

78,57 

52 

5 

25  -3  -3 

1 

1 

23 

49 

49 

45.91 

a, 00 

29 

6 

ia  -9  -3 

3 

1 

6 

IB 

40 

25 .25 

53.66 

14 

7 

63  -19  -1 

5 

-1 

2 

7 

33 

14.82 

46.03 

7 

{0,1,4} 

6 

22  0  2 

-4 

-2 

55 

54 

65 

61.54 

100 

62 

7 

43  -1  -1 

-5 

3 

62 

19 

52 

54.69 

67.44 

37 

8 

79  -7  -5 

-3 

7 

4 

6 

34 

11.29 

46.84 

5 

{0,2,4} 

5 

36  0  0 

0 

0 

100 

100 

100 

100 

100 

100 

6 

31  -1  -1 

-1 

-1 

94 

94 

94 

93  .a 

70.97 

67 

7 

57  -5  -3 

-1 

1 

47 

62 

80 

73.35 

50.88 

37 

8 

85  .9  .5 

1 

3 

57 

75 

a 

80.97 

43.53 

35 

{0,3,4} 

5 

16  4  0 

-4 

0 

a 

28 

45 

38.50 

100 

38 

6 

36  4  -4 

-4 

4 

74 

54 

54 

54.72 

61.11 

33 

7 

71  -1  -9 

-1 

7 

38 

56 

51 

51.33 

40.85 

21 

8  127  -15  -13 

5 

7 

12 

30 

44 

48.39 

29.13 

14 

{1,2,4} 

4 

11  -1  -1 

3 

3 

94 

65 

65 

67.29 

100 

67 

5 

20  -2  0 

2 

-4 

95 

91 

83 

86.23 

80.00 

69 

6 

36  -4  0 

0 

-4 

88 

86 

79 

81.05 

61.11 

50 

7 

6}  .9  -1 

-1 

-1 

37 

47 

71 

61.51 

46.03 

26 

8  106  -20  -2 

0 

2 

8 

20 

59 

37.04 

54.91 

13 

{1,3,4} 

5 

20  2  0 

-2 

-4 

95 

91 

83 

86.23 

80.00 

69 

6 

41  1  -3 

-3 

1 

81 

77 

83 

80.98 

53.66 

43 

7 

77  -5  -7 

-1 

5 

33 

52 

70 

62.27 

37.66 

23 

8  1^4  -20  -10 

4 

6 

10 

28 

57 

42.22 

27.61 

12 
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ZABLE  I  (Continued) 


Efficiency 


"oT~ 

or  nsin 

or  inter- 

> 

d.f .  effi- 

effi- 

Series 

n 

V 

A 

B  C 

effects 

notions 

P.I. 

cacy 

(2,3,0 

4 

11 

3 

-1  -1 

3 

10 

IB 

36 

22.86 

100 

23 

5 

25 

3 

-3  -1 

1 

23 

49 

49 

45.91 

64.00 

29 

6 

50 

0 

-6  0 

2 

23 

86 

49 

52.63 

44.00 

23 

7 

91 

-9 

-9  3 

3 

10 

45 

45 

42.22 

52.87 

14 

6  154 

-28 

-10  8 

2 

4 

17 

40 

26  J.9 

24.05 

6 

(0,1,5} 

7 

29 

3 

5  -5 

-3 

12 

52 

57 

49.40 

100 

49 

6 

65 

7 

1  -9 

1 

19 

19 

52 

36.91 

56.92 

21 

9 136 

6 

-8  -10 

8 

45 

6 

32 

17.18 

33.82 

6 

(0,2,5) 

6 

22 

-2 

2  1 

-2 

86 

92 

88 

89.21 

100 

89 

7 

43 

-1 

3  -1 

•5 

80 

97 

84 

86.57 

67.44 

58 

8 

85 

-1 

1  -5 

-3 

99 

74 

83 

81.50 

45.52 

35 

(0,3,5) 

6 

27 

3 

-1  -1 

3 

88 

86 

82 

83.57 

81.46 

68 

7 

57 

3 

-3  -1 

1 

86 

95 

95 

94.38 

50.87 

48 

(0,4,5) 

6 

22 

8 

2  -4 

-2 

35 

11 

31 

20.47 

100 

20 

7 

57 

13 

-3  “7 

1 

47 

26 

45 

38.45 

50.87 

20 

{1,2,5} 

5 

16 

-4 

0  4 

0 

64 

28 

45 

38.50 

100 

38 

6 

27 

-5 

3  3 

-5 

58 

74 

72 

72.11 

81.48 

59 

7 

49 

-5 

5 

-7 

61 

91 

74 

76.76 

59.18 

45 

6 

92 

-6 

4  -6 

-4 

91 

85 

78 

75.44 

40.21 

30 

(1,3,5) 

5 

16 

0 

0  0 

0 

100 

100 

100 

100 

100 

100 

6 

32 

0 

0  0 

0 

100 

100 

100 

100 

68.75 

69 

7 

63 

-1 

-1  -1 

-1 

99 

99 

99 

98.77 

46.03 

45 

(1,4,5) 

I  6 

27 

5 

3  -3 

-5 

59 

74 

72 

72  01 

81.48 

59 

7 

63 

9 

-1  -7 

-1 

8l 

85 

85 

79.09 

46.05 

36 

(2,3,5)  5 

21 

1 

-3  1 

5 

49 

57 

58 

57.29 

76.19 

44 

6 

41 

-1 

-3  3 

1 

81 

77 

83 

80 .92 

53.65 

43 

7 

77 

-5 

-3  3 

-3 

85 

93 

84 

85.78 

37.66 

32 

(2,4,5)  5 

16 

2 

0  2 

0 

64 

45 

64 

56.68 

100 

57 

6 

36 

4 

0  0 

-4 

88 

86 

79 

81.05 

61.11 

50 

7 

77 

5 

-3  -3 

-3 

85 

93 

84 

85.78 

57.66 

38 

(0,0,2,  3 

1  4 

12 

0 

0  -4 

4 

100 

62 

62 

63.78 

91.66 

58 

t0, 1,2,4 

5 

21 

-3 

1  1 

-3 

90 

90 

90 

90.30 

76.19 

69 

0,1, 3, 4 

l  5 

21 

1 

1  -3 

-3 

94 

88 

86 

91.39 

760.9 

70 

10,0,3,5 

1  6 

28 

2 

0  -2 

4 

97 

87 

80 

82.81 

78.57 

65 

7 

58 

2 

-2  -2 

2 

97 

96 

96 

95.87 

50.00 

48 

;o,i,4,5 

1  6 

28 

4 

4  -4 

-4 

57 

72 

72 

70.97 

78.57 

56 

,0,2,  3,  5 
10,2,4,5 

1  6 

1  5 

42 

17 

-2 

1 

-2  2 

1  1 

2 

1 

95 

97 

94 

97 

94 

97 

94.22 

9705 

52.38 

9401 

49 

91 

6 

37 

3 

1  -1 

-3 

95 

9 6 

95 

95.18 

59-45 

57 

26 


The  first  of  these  resolution-5  series,  JO,  1, 2 },  is  a  direct  amlogue  of 
the  class  of  oue-at-a-tlme  designs  for  the  resolution-^  case*  The  series  is  fully 
contractible,  and  each  design  contains  only  as  many  runs  as  there  are  parameters 
to  be  estiattted.  Like  cne-at-a-tijn  designs,  the  series  beccsMS  inefficient  very 
rapidly  as  '  be  number  of  factors  increases  * 

For  any  given  series,  the  efficiencies  tend  to  drop  monotcnloally  as  the 
nuniber  of  factors  increases,  as  one  night  expect .  It  is  noteworthy,  however,  that 
the  efficiency  of  the  seen  consistently  drops  much  more  rapidly  than  the  efficiency 
of  the  a&ln  effects,  which  in  turn  drops  store  rapidly  than  the  efficiency  of  the 
interactions .  Thus,  if  interactions  are  of  prinary  interest,  the  series  are 
particularly  attractive. 

Soars  interesting  irregularities  in  the  nonotonicity  of  the  efficiencies  as 
the  number  of  factors  Increases  asy  be  noted,  however.  For  example,  in  the  series 
{0,2,4}  the  efficiencies  of  the  mean,  aaln  effects,  and  interactions  all  decrease 
regularly  as  n  goes  free  5  to  6  to  7,  and  then  jump  up  as  n  increases  fren 
7  to  8;  similarly  they  Jusp  for  the  series  {0,2,5}  as  n  goes  from  6  to  7. 

There  seems  to  be  no  consistent  pattern  for  these  irregularities,  but  rather  they 
appear  to  be  an  artifact  of  the  particular  numerical  values  of  the  constants  asso¬ 
ciated  with  the  designs.  For  some  of  the  later  series  in  the  table,  for  which 
■  5,  there  is  a  tendency  for  the  efficiencies  is  measured  by  the  prediction 
index  to  Increase  and  then  peak.  When  this  Is  observed  the  dagr«e -of -freedom 
efficiency  is  very  low  at  the  peak,  so  that  these  series  do  not  appear  to  be  useful. 

For  some  fleaign*  it  appears  from  the  constants  A  through  0  that  the  effi¬ 
ciencies  can  be  improved  by  appending  a  run  at  the  treatment  combination  with  all 
factors  at  their  low  level.  This  procedure  adds  1  to  9,  B,  and  D  and  subtracts 
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1  txm  A  cad  C .  The  last  designs  in  the  table  vere  constructed  la  this  way,  sad 
for  most  of  these  selected  designs  the  prediction  Indices  did  la  fact  increase  to 
sons  degree,  the  value  of  the  extra  run,  as  Measured  by  the  efficacy,  was  only 
slight,  however,  except  for  one  design.  The  {2,4,5}  series  with  n  «  5  has  an 
efficacy  of  57 %  while  the  series  {0,2, 4, 5}  has  an  amazingly  higher  efficacy  of 
91*. 
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THE  H3UVATIQB  OF  EXPERIMENTAL  PBS  ISMS  BY  LINEAR  PROGRAMMING 

% 

GEHERAL  APPROACH 

Various  classes  of  incomplete  factorial  designs  can  be  characterised  as  solu¬ 
tions  to  sets  of  linear  constraints .  Papers  have  been  written  giving  the  charac¬ 
terisations  for  orthogonal  designs  (Appendix  A),  and  for  the  permutation-invariant 
and  clunipwise -orthogonal  classes  of  nan-orthogonal  designs  (Appendix  B) .  In  the 
case  of  orthogonal  designs,  the  characterization  is  somewhat  different  from  that 
given  by  Addeloan  [1].  An  investigation  of  the  differences  between  the  two  charac¬ 
terisations  has  revealed  that  designs  enjoy  two  optimality  properties  if  they  are 
orthogonal  in  the  sense  given  in  Appendix  A;  these  results  are  described  in 
Appendix  C. 

Given  the  number  of  factors  and  the  interactions  to  be  estimated,  the  linear 
constraints  characterising  the  class  of  designs  can  be  used  to  find  the  particular 
design  in  the  class  which  has  the  smallest  number  of  runs.  It  is  always  possible 
to  find  a  design  in  the  class  of  interest,  but  those  designs  which  are  readily 
obtainable  are  often  too  large  to  be  practical.  Starting  from  the  known  large 
design,  linear  programing  can  be  applied  in  order  to  obtain  the  smallest  design 
of  the  class .  This  procedure  is  illustrated  in  Appendix  A  for  the  family  of 

k 

orthogonal  2  resolution  3  designs,  where  the  half-replicate  is  obtained  starting 
from  the  full  factorial. 

In  order  to  study  the  linear-programming  technique  for  deriving  designs,  a 
special-purpose  linear-programing  computer  routine  has  been  written.  The  novel 
features  of  the  routine,  which  were  based  on  experience  with  particular  experi¬ 
mental-design  problems,  are  discussed  below.  Unlike  the  product-form  algorithm 
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in  general  use  for  solving  linear  programing  problems,  neither  the  basis  matrix 

nor  its  Inverse,  but  only  the  triangular  ized  basis,  is  available  at  any  Iteration. 

% 

Although  the  routine  has  been  used  only  for  experimental-design  problems,  it  can 

presumably  be  used  to  advantage  for  any  problem  of  a  combinatorial  nature  such  as 

.  * , 

assignment  problems,  transportation  problems,  and  scheduling  problems . 


IXSC3UFTXQB  OF  TEE  LINEAR  FROQRAMtlNGr  CGNPUOER  ROUTINE 
The  Simplex  Method 

A  brief  review  of  the  simplex  method  will  be  useful  in  order  to  introduce 
notation*  The  constraint  matrix  will  be  denoted  by  A,  and  its  number  of  rows 
and  columns  by  N  and  M,  respectively,  where  H  <  M*  The  constraints  are 
represented  by  the  matrix  equation  Av  ■  b,  where  v  is  the  M- component  vector 
of  variables  and  b  is  the  X- component  vector  of  right-hand  sides .  For  conven¬ 
ience  and  without  loss  of  generality  the  linear  form  to  be  minimized  will  be  taken 
to  be  simply  the  value  of  the  first  component  of  w,  which  will  be  denoted  by  X. 
Any  basis  selected  from  the  columns  of  A  will  be  designated  by  B .  The  cor- 

p ' 

responding  basic  solution  w  can  be  constructed  from  »  p  as  follows: 

if  column  Aj  appears  in  the  basis  as  column  Bj,  then  w^  *  pi;  if  column  Aj 
does  not  appear  in  the  basis,  then  w.  »  0. 


Suppose  the  solution  corresponding  to  a  particular  basis  B  is  feasible. 

Let  B^  *  Aj,  so  that  p^  «  w^,  and  let  0  *  B  ^A.  Any  column  of  A  can  be 
expressed  as  a  linear  combination  of  columns  of  B  in  the  form  Aj  ■  + 

«2J»2  +  "•  ♦  Suppose  we  wish  to  change  the  basis  by  introducing  say  Aj 

and  deleting  Br*  Expressing  B^  in  terms  of  the  new  basis,  we  have 

. B  -  fk±iUB  . f&lB  +-i_A 

r  «rj  ^  8„,  Br-1  \+X  g_,  ®H  +  g_,  Aj  ' 


rj 


®rj  J 
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aasu&ing  g  .  ^  0.  If  ve  substitute  this  value  for  B  in  the  equation 

g  p  " 

4i  *A  - b>  ve  have  E^r(pi“Pr  ♦  —■  At{  *  b.  Fran  this  equation  can 

be  determined  immediately  the  following  elements  of  the  solution  p*  corresponding 
to  the  new  basis:  p*  -  pA-pr  >,  for  i  4  r,  and  p*  -  p^/g^. 

»r  j  4 

*x 

Suppose  ve  vish  to  sake  the  choice  of  a  column  to  enter  ^the  basis  in 
such  a  way  that  1  does  not  increase .  Since  X  is  the  value  of  the  first  element 
of  the  solution,  it  changes  from  to  p?  ■  80  *****  ^  will  not 

increase  if  and  gyj  have  the  same  sign.  Ass  using  that  i  0  so  that  the 

original  solution  is  feasible,  suppose  ve  vish  to  choose  Br  in  such  a  way  that 
feasibility  is  maintained.  In  order  for  p*  to  be  positive  gr^  must  be  positive, 

“d  orier  for  **  to  ”*  p0*ltlv*'  r  ■UBt  be  Belected  ,o  «•* 

P ®  Is  the  minimum  value  of  those  p^/g^  for  which  is  positive. 


Thus,  if  J  is  chosen  so  that  is  positive,  and  if  r  is  chosen  so 

that  0  is  minimized,  then  the  solutions  associated  with  successive  bases  remain 
feasible  and  the  value  of  X  cannot  increase.  Thus,  the  process  may  be  continued 
iteratively  until  finally  no  column  can  be  found  to  enter  the  basis . 


Properties  of  Experimental  Design  Problems 

The  linear  programming  problems  for  finding  incomplete  factorial  designs  have 
several  distinctive  characteristics.  Obese  are  the  large  size  of  the  constraint 
matrices,  the  fact  that  the  matrices  consist  primarily  of  0's  and  -l's  (see 
Appendix  A),  and  the  fact  that  the  starting  basic  feasible  solutions  are  often 
highly  degenerate.  Due  to  the  degeneracy,  the  course  of  the  simplex  method  is 
typically  as  follows:  many  iterations  are  performed  with  no  change  in  X  because 
a  p  is  zero  for  which  g^  is  positive,  and  thus  6  m  0;  then  on  successive 
iterations  the  value  X  decreases  gradually,  with  the  associated  solutions  be.yig 
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nnodege  ttssrat* ;  finally  a  final  degenerate  solution  is  obtained,  but  many  more 
iterations  are  performed  before  the  condition  that  no  is  positive  occurs. 

Trlaagularlgation  in  the  Routine 

Execution  Is  initiated  with  the  matrix  A  stored  In  core  end  on  auxiliary 

tape,  with  the  first  H  columns  of  A  being  the  basis.  The  basis  matrix  is 

✓  .  **' 

triangular iaed  by  Gaussian  elimination  with  the  elimination  calculations  applied 
to  the  remaining  columns  of  A  and  to  b,  the  vector  of  right-hand  sides.  It 
will  be  helpful  in  the  sequel  to  go  through  'the  elimination  steps  for  a  simple 
example .  Suppose  the  original  matrix  is  denoted  by 


*11 

*12 

*13 

*14 

*15 

*21 

*22 

*23 

*24 

*25 

*51 

*32 

*33 

*34 

*35 

where  the  fourth  column  will  represent  a  column  of  A  not  In  the  basis  and  the 
fifth  column  will  represent  the  right-hand  sides .  After  one  step  the  matrix 
becomes 


*11 

*12 

0 

bl4 

*15 

*21 

*22 

0 

*24 

*31 

*32 

*33 

*3^ 

*55 

where  b^  *  a^  -  a^a^^/a^  asauaing  that  division  by  a^  is  possible  •  The 
■ext  step  yields  the  triangular  is  ed  system 


°JLL 

0 

0 

°14 

*15 

*21 

*22 

0 

*24 

b25 

*31 

*32 

*33 

*34 

*35 
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where  -  b^j  -  bi2b  2j/b22J  a0ain  that  division  by  t>22  is  possible. 

In  the  program  the  divisors,  or  pivots,  are  checked  to  see  that  they  are  larger  than 
a  tolerance  t^  (usually  10  or  10  J) .  If  not,  rows  are  interchanged  until  a 
large  enough  pivot  is  found.  If  none  can  be  found,  the  matrix  is  considered  singu¬ 
lar.  Because  the  matrices  have  as  elements  0's  and  -l's,  they  are  relatively 
veil  conditioned.  The  program  was  written  under  the  assumption  that  the  process  of 
picking  any  pivot  larger  than  the  tolerance,  rather  than  a  time-consuming  search  for 
the  largest  pivot,  would  be  satisfactory.  Some  difficulties  due  to  roundoff  have 
been  encountered  (see  below),  but  they  may  be  due  to  the  size  of  the  basis«matrices 
rather  than  an  inherent  weakness  in  this  method. 


Rules  for  Selecting  a  Column  to  Enter  the  Basis 


After  trlangularization,  the  selection  of  a  column  to  enter  the  basis  is  a 
relatively  simple  matter,  since  the  elements  g^  are  Just  ci  j/cn  •  Ordinarily 
there  will  be  a  number  of  columns  for  which  g^  is  positive,  and  three  rules 
have  been  tried  for  deciding  among  them.  These  are: 

RulelS  Select  tbe  first  column  (starting  vlth  )  -  M)  for  which 
is  positive. 

Rule  2:  Select  the  column  with  the  largest  positive  g^. 

Rule  3:  Select  a  column  for  which  6  >  0  if  one  exists,  otherwise  use 
Rule  2. 

An  experimental  Investigation  was  performed  in  order  to  evaluate  these  rules .  Two 

6  7 

representative  problems,  deriving  the  smallest  2  and  2'  resolution  3  designs, 


were  used  for  this  study.  The  constraint  matrices  for  these  problems  were  of  size 
25x66  and  30x150,  respectively.  Far  each  problem  ten  decks  were  constructed, 
vlth  the  columns  of  A  arranged  in  a  different  random  order  In  each  deck,  and  all 
three  rules  were  used  with  each  deck.  The  number  of  iterations  before  the  final 
solution  was  obtained  and  number  of  seconds  of  execution  time  were  as  follows: 
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KUHMIMQ  TIME  AKD  HUMBER  OF  ITERATIONS  FOR  THREE  SELECTION  RULES 


Problem 

26 

2 

,7 

Rule  1 

Rule 

2 

Rule  3 

Rule  1 

Rule  2 

Rule  3 

Deck 

sec. 

iter. 

sec .  iter . 

sec. 

iter. 

sec . 

iter. 

sec. 

iter. 

sec. 

iter. 

1 

39 

50 

9 

26 

14 

49 

269 

1075 

41 

66 

82 

164 

2 

15 

52 

8 

25 

10 

28 

53 

137 

56 

96 

80 

190 

3 

14 

69 

28 

81 

17 

61 

74 

217 

69 

165 

136 

311 

4 

11 

48 

10 

35 

12 

50 

198 

727 

117 

351 

159 

319 

5 

10 

39 

9 

25 

11 

36 

69 

217 

29O 

587 

99 

176 

6 

14 

71 

9 

32 

16 

80 

77 

307 

44 

77 

86 

168 

7 

14 

68 

9 

33 

12 

43 

122 

399 

88 

176 

267 

752 

8 

24 

161 

8 

28 

11 

36 

160 

605 

351* 

677 

125 

238 

9 

11 

59 

18 

26 

12 

44 

525* 

1630 

172 

401 

44 

66 

10 

10 

49 _ 

17  . 

29 

2?  , 

UJ  A 

61 

464 

jl 

60*  1 

. 

Averages 

IX 

12.5 

34 .0 

14  X 

54.6 

142 .8 

577.S 

127-5 

265 .6|  121 .3 

266  .< 

The  number  of  seconds  of  execution  is  the  ioportant  variable,  and  these  data  have 

been  analyzed.  As  a  check,  three  of  the  runs  were  duplicated,  and  the  running  time 

7 

was  exactly  the  same  twice  and  was  one  second  different  once .  For  two  of  the  2 
runs,  labeled  with  an  asterisk  in  the  above  table,  execution  was  stopped  before  a 
solution  had  been  readied,  and  the  time  at  which  this  occurred  was  entered  in  the 
table .  The  distribution  of  times  is  highly  skewed,  so  an  analysis  of  variance  was 
performed  on  the  double  logarithm  of  the  times.  This  analysis  revealed  that 
neither  the  decks  nor  the  rules  were  significantly  different  when  compared  with 
the  rule  by  deck  interaction.  Therefore,  for  these  two  problems  there  is  no  clear 
basis  for  choosing  between  the  three  rules.  For  a  particular  deck  there  is  often 
a  considerable  difference  between  the  three  rules,  however,  but  this  fact  is  of  no 
practical  use  since  for  an  actual  problem  the  properties  of  the  particular  arrange- 
meat  of  the  columns  are  unknown.  For  other  problems  than  the  2  or  2  it  may  some¬ 
times  be  possible  to  make  an  intuitive  choice  based  on  the  nature  of  the  rules . 

For  example,  if  the  number  of  columns  not  in  the  basis  is  small  compared  with  the 
number  which  are  in,  the  amount  of  additional  computation  required  for  applying 
Rule  2  is  comparatively  small,  and  one  might  wish  to  apply  that  rule. 
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Rules  for  Selecting  a  Column  to  Leave  the  Basis 

Once  a  column,  say  the  j-th,  has  been  selected  to  enter  the  basis,  a  column 
tails t  be  selected  to  leave.  Due  to  degeneracy,  the  value  of  d  is  usually  0, 
and  moreover  there  are  typically  several  columns  which  have  this  value  of  $. 

There  is  nothing  lost,  and  much  computational  time  saved,  by  selecting  the  column 
to  leave  with  the  lowest  order  number.  If  column  r  yields  the  value  0  ■  0, 
then  the  and  only  need  be  calculated  for  i  «  1,  ***  ,  r.  Moreover, 

when  the  new  column  is  inserted  into  the  basis,  the  basis  is  still  triangu¬ 
lar  ized  except  for  the  first  r  rows  and  columns,  which  produces  a  substantial 
saving  in  computer  time  over  triangularis ing  the  whole  basis.  Since  it  is  highly 
advantageous  to  pick  low  r's,  the  matrix  A  should  be  initially  arranged  so  that 
columns  of  B  corresponding  to  zero  elements  of  p  are  placed  in  the  basis  imme¬ 
diately  after  A^,  the  column  corresponding  to  1. 

The  column  which  has  been  selected  to  enter  the  basis  is  substituted  for  the 
column  to  leave,  and  the  latter  is  set  equal  to  zero.  In  the  course  of  applying 
the  simplex  method  it  may  be  necessary  for  a  column  which  has  left  the  basis  to 
reenter  at  a  later  iteration.  Because  of  this,  in  most  linear  programming  routines 
a  column  which  has  left  the  basis  is  immediately  updated  in  such  a  way  that  it 
enjoys  the  same  computational  status  as  if  it  had  never  been  in  the  basis.  Because 
of  the  size  of  the  constraint  matrices,  there  is  usually  a  wealth  of  columns  which 
can  enter  the  basis  at  any  iteration,  so  that  little  is  lost  by  deleting  the  columns 
which  have  been  removed.  When  it  is  determined  that  no  column  can  enter  the  basis, 
the  original  matrix  is  called  back  in  from  auxiliary  tape  storage,  so  that  the 
columns  which  had  been  deleted  can  become  available  for  reentry. 
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Detection  of  Roundoff  Error 


During  the  course  of  repeated  rc triangular izatioce  of  the  basis  matrix, 
roundoff  error  Inevitably  builds  iq>.  If  this  buildup  is  ignored,  singular  bases 
or  infeasible  solutions  may  be  obtained,  and  much  computer  time  lost  In  meaning- 
less  calculations .  Suppose  that  a  system  of  equations  has  been  triangular  ized, 
and  that  the  solutions  to  the  system  are  to  be  obtained  by  back  substitution.  An 
example  of  the  triangular  ized  system  vac  shovn  above  to  be  of  the  form 


CU 

0 

0 

c14 

c15 

b21 

b22 

0 

b24 

b25 

*31 

*32 

*33 

*34 

*35 

where  the  fifth  column  of  the  original  matrix  is  the  vector  of  right-hand  sides. 
We  wish  to  solve  for  p  -  (p^p^p^)'  such  that 

*1?2.  +  ^2*2  +  *3P3  *  *5  * 

where  the  Aj  are  the  original  columns  of  the  matrix.  From  the  triangularlzed 
system  we  have  immediately 

P1  ■  c15/oll 

p2  *  (b25  ‘  b21pl)/,b22 

p3  ’  (a35  *  a31pl  •  a32p2)/a33  ' 

Note  that  the  computation  of  involves  all  the  elements  which  have  previously 
been  computed,  so  that  this  element  is  the  most  sensitive  to  the  accumulation  of 
roundoff  error.  Once  again  the  degeneracy  of  the  solutions,  usually  considered  a 
disadvantage  in  linear  programming,  can  be  put  to  good  use.  Due  to  degeneracy, 
many  of  the  have  the  value  zero  except  for  roundoff  error,  which  can  be 
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expected  to  make  these  p,  small  positive  or  small  negative  numbers ,  Since  it  is 
not  possible  for  a  pi  to  have  a  negative  value  legitimately,  a  test  is  aade  of 
all  the  pi  which  are  calculated  at  a  given  iteration.  If  any  is  found,  which  is 
a  larger  negative  number  than  a  specified  tolerance  tg  (usually  from  -10  ~  to 
-10**5),  the  solution  is  considered  to  be  infeasible,  even  though  the  infees  lb  llity 
nay  be  due  solely  to  roundoff  error. 

If  a  solution  is  encountered  which  is  considered  to  be  infeasible,  or  if  a 
basis  aatrix  is  found  which  is  considered  singular,  an  attempt  is  nade  to  correct 
the  problem  automatically.  This  is  done  by  going  back  one  iteration,  calling  the 
constraint  natrix  back  from  tape  storage,  and  performing  a  new  complete  triangu¬ 
lar  isation  on  it.  On  nany  occasions  this  procedure  has  successfully  corrected  the 
difficulty.  If  the  natrix  is  found  to  be  singular  or  the  solution  infeasible  after 
going  back  one  iteration  and  starting  with  original  data,  the  problem  is  discon¬ 
tinued. 

CGMPUTATIOKAL  EEFKRIKHCE  WHS  TBX  BOUTUB 

67 

Besides  the  2  and  2*  problems  mentioned  earlier,  attests  to  determine  expli- 

A  q  5 

cit  2  ,  2  ,  and  y  designs  have  been  attempted  without  complete  success.  For  the 

Q  r 

2  and  y  the  value  of  1  decreased  to  the  minimum  theoretical  values  of  3  and  2, 
respectively,  but  the  solution  did  not  consist  of  integers.  This  contradicts  pre¬ 
vious  experience  noted  in  Appendix  A;  previously  no  non-integral  solution  in  which 
X  was  an  integer  had  been  obtained  for  an  experimental -design  problem. 

Q 

In  the  class  of  orthogonal  2  resolution- 3  designs  the  smallest  is  known  to  be 
the  twelve-run  Placke tt -Burma  design  [2].  Attempts  were  made  to  derive  this 
design  starting  from  the  twenty-run  Placke tt-Buxman  design,  for  which  X  *  5 .  The 
constraint  matrix  for  this  problem  is  of  size  38  x  238.  The  basis  matrix  was 


called  back  fro*  tape  and  re  triangular  ised  for  four  different  reasons  during  the 
course  of  tfri*  problem;  these  vere(a)&n  observed  singular  basis  matrix,  (b)an 
observed  infeaslbllity,  (c)no  column  could  enter  the  basis,  or (d)  100  iterations 
were  performed  since  the  last  callback.  The  complete  history  of  tills  problem. 
Involving  three  separate  computer  runs.  Is  as  follows. 


Callback  Iteration 


Comment 


0 

24 

1 

100 

2 

9fc 

3 

100 

4 

45 

5 

100 

6 

51 

7 

1 

7 

3 

7 

94 

8 

100 

9 

56 

10 

1 

10 

2 

10 

79 

11 

68 

12 

7 

12 

17 

13 

1 

13 

39 

14 

1 

14 

27 

13 

0 

0 

2 

0 

8 

1 

0 

0 

1 

1 

0 

singularity 

no  column  can  enter 

false  Infeaslbllity  (due  to  roundoff) 

true  Infeaslbllity 
still  infeasible,  1  ■  6.247 
Initial  solution  reobtalned  (X  «  5) 
false  infeaslbllity 

true  Infeaslbllity 

still  infeasible,  X  -  12 .000 

Initial  solution  reobtalned 

singularity 

false  Infeasibility 

first  decrease  In  X  to  X  ■  4  .578,  followed  by  10  consecutive 
changes 

false  infeaslbllity,  X  <*  3 .540 
X  ■  J  .540,  followed  by  38  consecutive  changes 
false  infeaslbllity,  X  ■>  2.881 
X  «  2.876,  followed  by  26  consecutive  changes 
false  infeaslbllity,  X  -  2 .576 
singularity  or  Infeaslbllity  after  callback 
XBf  RUB 

X  a  2 .580,  followed  by  6  consecutive  changes 
no  column  can  enter,  X  -  2 .517 
minimum  solution  readied,  X  >  2.5OO 
m  HUM,  start  at  X  «  3 .027,  require  X  a  3  .000 
X  m  3.000,  no  column  can  enter 
minimum  solution  reached,  X  ■  3. 000 


The  variables  w^  in  the  linear -grog  rawing  formulation  of  these  problems 
correspond  to  the  treatment  combinations  of  the  full  factorial.  The  numerical 
value  of  a  Is  the  number  of  tl».  the  correspond**  treatnent  combination  Is 
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included  in  the  design.  The  following  fine!  solution*  obtained  after  shout  1000 
iterations*  does  not  consist  of  integers*  hence  is  useless  as.  a  design. 


Treatment  Combination 

Value 

Treatment  Combination 

00000000 

1.000 

10010010 

.453 

00011110 

.720 

10011110 

.007 

00101001 

.466 

10100001 

.098 

00101101 

.058 

10100100 

025 

00110110 

J.72 

10100101 

-JL08 

00111101 

.548 

10100110 

.060 

00111110 

.035 

i 0101100 

.346 

01000011 

.407 

10110001 

.04? 

01000110 

.008 

10110011 

.936 

0  1  0  0  1 0  10 

.035 

11000000 

.058 

01001011 

.381 

110  OHIO 

.323 

0101010  1 

.449 

11001111 

.050 

011001 11 

.646 

4  1  10  10  10  0 

.315 

01110110 

.318 

11010101 

.309 

01111000 

.479 

11011001 

.666 

01111010 

.233 

11100100 

.490 

01111111 

.045 

11101010 

.555 

10001101 

.203 

11101101 

.466 

100  0  1111 

•536 

The  snallest  orthogonal  2^  resolution-}  design  hsa  not  as  yet  been  obtained* 
although  three  computer  runs  have  bean  attempted.  The  constraint  Matrix  is  of 
size  47  x  $14*  and  the  computations  were  initialized  with  tba  24-run  Jtlackett- 
Burma  design*  for  which  U6.  The  first  computer  run  meat  only  53  iterations 
with  two  callbacks  before  stopping.  The  second  went  only  10.  On  tip  third  run* 
the  value  of  X  decreased  from  6  to  4.709  on  the  74th  iteration*  Another  $ 
iterations  brought  the  value  down  to  X  *  $.460.  in  this  one  roundoff  error 
during  the  new  triangular izatica  built  up  to  aucb  a  degree  that  false  infeasibllily 
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The  Millet t  ksavn  orthogonal  3^  design  of  resolution  4  is  the  1/3  replicate 
Involving  8l  runs .  it  follows  fraa  the  results  of  impend  lx  A  that  the  number  of 
runs  la  such  a  design  must  he  a  multiple  of  27,  and  it  is  known  that  27  runs  is 
insufficient*  It  appears,  therefore,  that  there  should  exist  a  54-run  design, 
ttie  constraint  matrix  is  of  sire  I32  *  245,  but  due  to  its  sice,  the  portion  of 
the  matrix  in  core  at  agy  one  tine  is  132  *  189,  the  rm  inde  1  being  on  auxiliary 
tape  storage.  Six  computer  runs  were  performed,  involving  381  iterations  and  27 
callbacks,  each  of  which  stopped  when  singularity  or  infeasibility  was  observed 


1— cd  lately  after  a  callback .  In  between  runs  columns  were  rearranged  or  the 
tolerances  and  t^  defined  above  were  changed  in  order  that  the  problem  would 
not  hilt  as  it  had  during  the  previous  run.  During  the  seventh  computer  run, 
involving  120  Iterations  and  6  call  barks,  the  value  of  X  decreased  to  2.075, 


and  during  the  eig^tti  xun  the  value  X  «  2.000  was  obtained  for  the  following 
solution,; 


Solution  to  3^  Problem— Treatment  Combinations 
and  the  Corresponding  Values  of  the  Solution 


00000 

1.000 

01112 

.247 

02210 

.409 

10221 

.270 

12100 

.521 

20211 

.330 

00012 

.318 

01122 

.469 

02211 

048 

10222 

•312 

12102 

.341 

20220 

.734 

00021 

•637 

Q1200 

.443 

02220 

.317 

11001 

.451 

12111 

.583 

21000 

•734 

00022 

.045 

01201 

.283 

02221 

.334 

11002 

.419 

12120 

.231 

21010 

.303 

00102 

.683 

01202 

067 

10001 

.413 

11010 

.461 

12122 

.323 

21X512 

.230 

00110 

.016 

01210 

.258 

10002 

.055 

non 

.084 

12201 

.475 

21021 

.734 

00111 

.482 

01220 

046 

1X5010 

.307 

11022 

.585 

1221X5 

.277 

211X52 

.633 

00120 

.820 

01221 

•353 

10011 

.095 

moo 

.615 

12212 

•633 

21111 

.633 

00201 

.091 

01222 

•351 

1X5012 

.545 

11102 

.040 

12221 

.308 

21120 

-734 

00202 

.226 

02001 

.604 

10020 

.262 

11112 

.548 

12222 

•306 

21201 

.633 

00210 

.164 

02010 

.498 

10021 

.323 

11121 

.734 

20001 

•532 

21210 

.230 

00211 

.790 

02011 

.028 

1X5100 

.037 

11122 

.064 

20010 

.431 

21212 

.605 

00212 

.230 

020 12 

•233 

10101 

.446 

11200 

.130 

20012 

.303 

21222 

•532 

00222 

.499 

02022 

.637 

1X5110 

.596 

11202 

.3^5 

20022 

•734 

22002 

•734 

01002 

.596 

0210T 

•216 

1X5112 

.274 

11210 

.176 

20100 

.532 

22011 

.734 

01011 

•551 

02101 

.389 

10121 

.238 

11211 

.731 

201X52 

.303 

22020 

.532 

01012 

*371 

02110 

.082 

10122 

.41X5 

11220 

.618 

20111 

.303 

22101 

•532 

01020 

.502 

02112 

.601 

1X5200 

.431 

12000 

.266 

20112 

.330 

22110 

.734 

01021 

•179 

02120 

.215 

1X5201 

.215 

12002 

.396 

20121 

.532 

22122 

•734 

01100 

1 

.079 

02121 

.496 

1X5202 

.403 

12011 

.507 

20201 

.303 

22200 

.754 

01101 

•633 

02200 

.262 

1X5201 

083 

12020 

.704 

20202 

.330 

22212 

.532 

01110 

•572 

02202 

.529 

1X5220 

.185 

12021 

027 

20210 

.303 

22221 

•734 

Attempts  were  wade  to  obtain  a  solution  by  holding  X  fixed  and 
an  arbitrarily  selected  component  of  the  solution  vector,  but  they  did  not  succeed 
in  Obtaining  an  Integral  solution.  One  approach,  as  yet  untried,  vhick  nay  be 
useful  Is  to  alnlaise  the  sub  of  an  arbitrary  selection  of  the  v^. 

One  special  run  was  conducted  on  the  y,  during  which  running  tines  and 
information  on  the  progress  of  the  routine  were  printed  rather  than  the  usual 


output.  The  BaxlnuB  amber  of  iterations  between  callbacks  was  set  at  30.  A  total 
of  15  callbacks  was  wade,  aany  of  which  ran  the  full  30  iterations .  The  aaount  of 


tine  required  for  a  complete  triangular  isation  of  the  2JZ  x  LB9  ays  taw  averaged 
15  seconds .  It  took  about  3  seconds  to  call  the  watrix  back  from  tape .  For  the 
first  29  iterations,  *©  till  the  first  callback,  the  cclum  ambers  of  the  colmna 
which  left  the  basis  were  as  follows:  -  “  -  ^  - 

2  x  21  . 15  5,  9  '  .  ■ 

3  2  4  30  6  4 

3  6  23  13  3  10 

4  14  6  40  8  2 

2  18  2  4  8 

FOr  the  largest  re  triangular  lotion,  Involving  the  first  40  rows  and  coluwns,  the 
tlae  was  53/6°  of  one  second.  The  smiles t  retrlangularlsatlons  were  perforaed 
In  1/60  of  one  second.  This  illustrates  the  advantage  of  allowing  the  earliest 
possible  coluwn  to  leave  the  basis.  It  should  be  noted  that, the  nwxjnt  of  tine 
taken  for  all  computations  other  thin  retrlangularl  sat  ions  and  reading  In  ffo» 
tspe  is  negligible.  The  total  execution  tine  for 
puter  was  6  minutes  and  40  seconds . 


>1 


-hhntiffh  experience  with  the  routine  hu  not  been  entirely  successful,  it  has 
demonstrated  the  general  feasibility  of  the  techniques  used.  In  particular,  the 
use  of  the  triangularlzed  Matrix  rather  than  the  basis  and  its  inverse  appears  to 
be  advantageous .  The  Method  used  to  eliminate  roundoff  error  works  quite  well,  as 
is  evidenced  by  the  number  of  times  false  Infeasibilities  were  detected  and  cor¬ 
rected.  Hie  number  of  iterations  after  callback  at  which  false  infeasibility  is 
detected  varies  considerably  from  callback  to  callback.  Therefore,  a  constant 
callback  frequency  would  not  appear  to  be  effective  •  It  is  interesting  to  note 
that  more  success  was  had  with  the  y,  with  constraint  matrix  of  order  132,  than 
with  the  2^,  whose  constraint  matrix  is  only  of  order  47,  at  least  in  the  sense 
that  more  iterations  per  computer  run  were  performed.  It  is  gratifying  that  a 
problem  as  large  as  the  y*  could  he  hard  led  with  even  partial  success,  but  it  is 
disappointing  that  the  2^  turned  out  to  be  so  difficult  a  problem. 

It  is  unlikely  that  linear  programming  will  prove  to  be  a  useful  technique  for 
deriving  new  practical  incomplete  factorial  designs  due  to  the  sizes  of  the  con¬ 
straint  matrices  involved,  the  number  of  iterations  required  for  solutions,  and  the 
problem  of  non- integral  solutions.  It  does  appear,  however,  that  it  may  be  possible 
to  use  the  1  constraints  for  these  problems  to  direct  a  systematic  trial -end- 
error  search  for  designs  on  a  computer.  It  is  hoped  that  the  special  methods 
incorporated  into  the  computer  routine  will  have  more  general  usefulness  with  other 
types  of  linear  programming  problems . 
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SUMMARY 

A  necessary  and  sufficient  condition  is  derived  for  an  incomplete 
factorial  design  to  be  orthogonal  under  a  very  vide  class  of  param¬ 
eterization*  .  She  condition  is  that  for  every  subset  of  t  factors, 

where  t  is  twice  the  number  of  factors  in  the  highest-order  inter - 

ft: 

action  to  be  estimated,  each  combination  of  levels  occurs  the  same 
number  of  times.  Shis  condition  is  an  extension  of  one  due  to 
Plackett  (1946) .  from  this  condition  can  be  derived  a  set  of  linear 
constraints  on  the  number  of  occurrences  of  each  treatment  combination. 
The  problem  of  finding  the  smallest  orthogonal  design  to  fit  a  given 
experimental  situation  is  therefore  an  integer  linear  programming 
problem. 

It  Is  shown  that  designs  can  be  derived  using  ordinary  linear 
programing  algorithms  with  a  few  special  devices,  rather  than  the 
more  complicated  integer  linear  programming  techniques .  As  an 

k 

example,  a  one-half  replicate  of  the  2  is  derived  in  detail. 

It  is  desirable  to  have  available  from  an  experiment  an  estimate 
of  experimental  error  obtained  from  duplicate  runs .  Such  an  estimate 
is  unbiased  by  high-order  interactions .  It  is  shown  how  the  linear 
programming  technique  can  be  modified  to  find  the  smallest  orthogonal 
design  containing  at  least  one  duplicated  run. 


1.  IHTRODUCnOW 


In  this  paper  we  consider  the  general  class  of  orthogonal  incom¬ 
plete  factorial  designs .  Any  design  which  does  not  contain  the  full 
factorial  will  he  considered  as  an  incomplete  factorial  design,  so 
that  the  class  includes  designs  having  more  than  one  run  at  some  of 
the  treatment  combinations.  The  class  of  orthogonal  incomplete 
factorial  designs  contains  as  a  subclass  the  familiar  fractional 
factorials,  for  which  the  treatment  combinations  at  the  runs  to  be 
made  form  a  subgroup  or  coset  under  a  suitably  defined  group  operation. 

The  definitions  of  such  fundamental  concepts  as  the  parameteriza¬ 
tion  underlying  the  analysis  of  factorial  designs,  aliasing  in  incom¬ 
plete  designs,  and  the  resolution  of  a  design  are  often  defined  In 
terms  which  apply  only  to  fractional  factorials.  Since  the  discussion 
will  not  be  limited  to  such  designs,  definition*  which  apply  to  any 
incomplete  factorial  design  will  be  formulated. 

Parameterization 

The  parameters  for  the  model  assumed  in  a  factorial-design  situ¬ 
ation  are  often  defined  in  terms  of  their  estimators  in  the  full 
factorial.  For  example,  the  main  effect  of  a  two-level  factor  may  be 
defined  as  one-half  the  difference  of  the  average  response  at  the  high 
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level  and  the  average  response  at  the  low  level  of  the  factor  (see,  for 
example,  Yates,  1935)  •  For  our  purposes  it  is  more  convenient  to 
define  parameters  directly  in  terms  of  the  expected  values  of  the 
responses.  For  notational  simplicity  ve  will  define  the  parameters 
for  a  design  containing  only  four  factors .  We  will  use  sr  to  desig¬ 
nate  the  number  of  levels  of  the  r-th  factor,  the  Integers 
0,  *  *  *  ,  s^-1  to  designate  the  levels  of  each  factor,  and  a  set  of 
integers  in  juxtaposition  to  designate  a  treatment  combination .  Let 
the  expected  value  of  the  response  for  rune  at  treatment  combination 
ijkX  be  denoted  by  A  dot  replacing  a  subscript  will  mean 

that  the  expected  value  has  been  computed  over  all  values  for  that 
subscript.  The  grand  mean  is  defined  to  be  ^  .  The  main-effect 

parameters  for  the  first  factor  are  a  set  of  s^-1  linearly  inde¬ 
pendent  contrasts  (linear  combinations  the  sum  of  whose  coefficients 
is  zero)  among  the  ,  and  main  effects  for  the  other  factors 

are  defined  analogously.  The  e^-1  contrasts  are  usually  taken  to 
be  orthogonal  as  well  as  linearly  independent.  The  two- factor 
interaction  parameters  for  the  first  two  factors  are  a  set  of 
(s. — l)(s„-l)  contrasts  among  the  quantities  u. .  ,  and  these 

«  *  j  *  * 

contrasts  are  usually  taken  to  be  orthogonal  to  each  other  as  well  as 
to  the  contrasts  defining  main  effects.  The  definitions  of  the  inter¬ 
action  parameters  among  other  pairs  of  factors  are  analogous . 


Similarly  three-  and  four-factor  Interactions  can  be  defined  as  con¬ 
trasts  certain  of  the  If  4  Is  the  vector  of 

and  if  p*  is  the  vector  of  Ifer  parameters  vhich  have  been  defined, 
ve  may  write  p*  *  or  \x  *  Q*p*.  The  matrix  Q*  is  in  fact 

yini^g ingniRr*  from  the  given  definition  of  p*  in  terms  of  u» 

The  effect  of  ignoring  hlgh-order  interactions 

Experience  indicates  that  in  most  factorial  situations  the  high- 
order  interactions  are  so  small  that  their  estimates  from  an  experiment 
are  not  likely  to  be  statistically  significantly  different  from  zero. 

If  the  high-order  interactions  are  ignored  (treated  as  if  they  were 
zero)  the  model  can  be  rewritten  n  “  C JP,  where  p  is  a  vector 
containing  those  components  of  p*  corresponding  to  non- ignored 
parameters,  and  where  Q  contains  a  corresponding  selection  of  the 
columns  of  Q* .  If  many  components  of  P*  can  be  ignored,  it  may  be 
advantageous  to  run  an  incomplete  factorial  design. 

Let  X  be  a  vector  of  observations  from  an  incomplete  factorial 
design,  and  suppose  EX  can  be  expressed  as  EX  »  XP  under  the 
assumption  that  ignored  parameters  are  zero,  where  X  contains  a 
selection  of  the  rows  of  Q.  Each  column  of  X  is  sosoclated  with 
one  component  of  p,  and  the  column  associated  with  a  parameter  will 
be  called  the  coefficient  vector  for  that  parameter . 
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If  the  cross  -product  matrix  X'X  is  nonsingular,  then 
6  «  (X'X)_1X'Y  is  the  least-squares  estimate  of  p.  If  X'X  is 
nonsingul&r,  we  will  say  that  $  is  measurable  ( Conner  and  Zelen, 

1959.  p.  2) .  Hote  that  ve  hare  mot  actually  assumed  that  p  Is  estimable. 
When  ignored  parameters  hare  the  value  zero,  measurability  implies  estima- 
bllity.  The  expected  value  of  $  is  (X'X)~\'X^*  (Box  and  Wilson, 

1951,  Appendix  b),  where  X*  consists  of  rows  of  so  that  the 

expected  value  of  a  component  $q  of  0  Is  p^  plus  a  linear  ccabl  ition 
of  ignored  parameters .  This  effect  is  called  aliasing  (not  confounding) 
and  Pq  la  said  to  be  all&sed  with  the  linear  combination  of  Ignored 
parameters .  Confounding  is  a  special  case  of  aliasing  In  which  block 
contrasts  are  aliased  with  ignored  interaction  parameters . 

The  concept  of  the  resolution  of  a  design,  introduced  by  Box  and 
Hunter  (1961),  will  prove  useful,  and  a  slightly  modified  definition 
is  as  follows.  If  a  design  is  such  that  all  parameters  involving  r 
or  fewer  factors  are  measurable  ignoring  all  interactions  of  r+1  or 
more  factors,  the  design  is  said  to  be  of  resolution  2r+l;  if  all 
parameters  involving  r-1  or  fewer  factors  are  measurable  ignoring 
all  interactions  of  r+1  or  more  factors,  the  design  is  said  to  be 
of  resolution  2r . 
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Conditions  for  optimality  of  orthogonality 

It  has  often  been  stated  that  orthogonal  designs  yield  the  maxi¬ 
mum  possible  information  per  run  for  each  parameter  estimate .  The 
actual  theorem  v&s  first  proved  by  Plackett  and  Buxman  (1946)  •  If  a 
class  of  designs  is  considered  for  which  the  lengths  of  the  column 
vectors  of  the  matrix  X  are  fixed,  and  if  there  is  an  orthogonal 
design  In  this  class,  then  the  variances  of  all  estimates  are  Indi¬ 
vidually  minimized  over  the  class  using  the  orthogonal  design. 

The  condition  that  the  lengths  of  the  columns  be  fixed  does  more 
than  fix  the  scale  of  measurement,  however.  If  the  parameterization 
is  fixed  it  often  happens  that,  for  some  of  the  parameters,  nan- 
orthogonal  designs  have  coefficient  vectors  of  greater  length  than 
those  for  orthogonal  designs  with  the  same  number  of  runs .  It  can 
also  happen  that  nan-orthogonal  designs  have  lower  variances  for  some 
of  the  estimates  than  orthogonal  designs  with  the  same  number  of  runs 
as  in  the  following  example. 

Consider  an  ejqperimental  situation  with  a  single  factor  at  three 
levels.  Let  Mqj  Uj,  and  be  the  expected  value  of  the  response  at 
the  three  levels,  and  let  the  parameters  to  be  estimated  be  the  grand 
mean,  (m^  +  M^  +  Mg)/3#  the  "linear  effect",  (Mg  -  Mq)/2,  and  the 
"quadratic  effect",  (^  -  +  Mg)/6.  The  design  consisting  of 
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seven  rum  at  each  level  is  orthogonal,  and  the  variances  of  the  esti¬ 
mates  are  a2  times  1/21,  l/l4,  and  T/294,  respectively.  The  design 
consisting  of  eight  runs  at  levels  0  and  2  and  five  runs  at  level 
1  is  non- orthogonal  and  has  variances  o2  times  l/20,  l/l6,  and 
7/240.  The  fact  that  non-orthogonal  designs  may  be  superior  in  this 
sense  to  orthogonal  ones  has  apparently  never  been  utilized  in  the 
construction  of  incomplete  factorial  designs. 

j* 

Although  orthogonal  designs  are  not  in  general  optimum  In  the 
sense  of  minimizing  all  the  variances,  experience  indicates  that  they 
are  rarely  far  from  optimum.  In  the  important  case  of  factors  at  two 
levels,  orthogonal  designs  are  optimum  because  all  coefficient  vectors 
consist  of  plus  and  minus  ones,  so  that  the  lengths  of  the  columns  of 
X  are  automatically  fixed.  It  has  previously  been  shown  (Webb,  1964) 
that  for  most  parameter! nations,  orthogonal  designs  satisfy  other 
criteria  for  optimality  (see  Section  2  below) .  In  addition,  they 
afford  great  computational  advantages,  and  therefore  there  is  still 
reason  to  be  very  much  concerned  with  methods  for  constructing  orthog¬ 
onal  designs. 

In  a  subsequent  paper  it  will  be  shown  how  Hurr  constraints  can  be 
used  to  characterize  classes  of  non-orthogonal  designs. 
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CHABACTERIZATIOB  OF  OBTHO£K8dIJTX 

Tensor  product  parameterizations 

A  design  is  orthogonal  if  X'X  is  diagonal,  so  that  orthogonal¬ 
ity  of  a  design  depends  on  the  parameterization  used  to  describe  the 
response.  A  characterization  of  orthogonality  will  j%  derived  first 
for  a  special  class  of  parameterizations  which  will  be  referred  to  as 
tensor  product  parameterizations  and  then  extended  tOA  larger  class 
of  parameterizations  •  ,,  . 

Let  U  arti  V  be  m  and  n  dimensional  vector  spaces  with 
bases  u^,  “*  i  um  a&d  ,  vn,  respectively.  Let  w^  be 

an  mn  dimensional  vector  whose  components  are  the  an  possible 
products  taken  in  a  given  order  of  a  component  of  with  a  compo¬ 
nent  of  Vj.  As  i  goes  from  1  to  a  and  j  goes  from  1  to  a, 
the  on  vectors  which  are  produced  in  this  manner  are  linearly  inde¬ 
pendent.  Let  V  be  the  vector  space  spanned  by  the  v^.  Then  V 
is  called  the  tensor  product  of  the  vector  spaces  and  V,  and  the 

basis  {v^}  is  called  the  product  basis  of  the  component  bases 


the  basis  {w^}  is  an  orthogonal  basis  for  the  tensor  product. 
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Sippoee  that  In  an  experiment  involving  n  factors  the  min 
effects  of  the  i-tfc  factor  are  defined  as  orthogonal  contrasts  ma^ng 
the  ejected  values  averaged  over  the  levels  of  the  other  factors . 

Ihe  coefficient  vectors  of  these  s^-l  contrasts*  together  vith  the 
s^ccqponcnt  vector  all  of  whose  components  are  the  constant  l/s^, 
form  an  orthogonal  basis  for  a  vector  space  V^.  Let  the  two-factor 
Interaction  parameters  between  the  1-th  and  j-th  factors  be 
defined  by  all  contrasts  (except  those  which  define  nain  effects  and 
the  grand  mean)  whose  coefficient  vectors  are  In  the  product  basis  of 
the  tensor  product  of  and  .  The  contrasts  are  between  the 
expected  responses  averaged  over  the  levels  of  the  remaining  n-2 
factors.  Similarly,  three-factor  through  n-factor  interactions  are 
defined  In  terms  of  the  tensor  product  of  sets  of  the  V^. 

Since  the  basis  for  each  Is  orthogonal,  the  product  bases 
for  tensor  products  of  sets  of  the  are  also  orthogonal .  There- 
fore,  the  rows  of  the  matrix  Q#~  appearing  in  the  relationship 
£*  *  axe  mutually  orthogonal.  A  column  of  Q*  Is  therefore 

a  multiple  of  the  corresponding  row  of  Q*"1.  Since  the  components 
Of  the  row  of  defining  the  grand  mean  are  all  equal  to  the 

constant  s^,  and  since  this  matrix  Is  of  order  1^,  s^, 

the  coefficient  vector  for  the  grand  mean  In  any  (complete  or  Incom¬ 
plete)  factorial  design  has  all  its  components  equal  to  unity*  Dus  to 


the  tensor  product  parameterization,  a  component  of  the  coefficient 
vector  for  an  interaction  parameter  is  the  product  of  the  corresponding 
components  of  the  coefficient  vectors  for  main  effects  of  factors 
making  up  the  interaction* 

It  should  be  noted  that  such  parameterizations  are  not  new,  In 
particular,  the  equivalent  of  tensor  product  parameterizations  mas 
used  by  Placket t  (1946,  p.  3$0)  in  deriving  conditions  for  orthogonal¬ 
ity,  although  he  did  not  use  OUT  present 

The  characterization 

•■MMMnMBManMMlWWMMMMI 

Placket t  (1946)  vas  concerned  with  the  situation  in  which  the 
main  effects  of  a  number  of  factors  are  to  be  estimated  together  with 
the  interaction  between  any  two  of  them.  He  found  that  in  order  that 
the  interaction  parameters  between  A  and  B  be  estimated  orthog¬ 
onally,  all  combinations  of  levels  of  A  and  B  must  appear  equally 
often  with  the  levels  of  every  other  factor.  In  this  case  the  inter¬ 
actions  between  A  and  every  other  factor,  and  between  B  and  any 
other  factor,  msy  slao  be  estimated  orthogonally.  In  the  present 
paper  we  are  concerned  with  the  situation  in  which  all  interactions 
of  a  given  order  are  to  be  estimated .  It  is  seen,  therefore,  that 
Blackett's  result  does  not  apply.  Our  gen  ?el  characterization  is 
given  in  Theorem  4,  the  first  three  theorems  giving 
results. 


to 
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OBEOBBW  1.  If  in  a  given  design  the  i-th  factor  appears  at  s^ 
levels,  the  coefficient  vectors  for  tJH  its  main-effect  parameters  are 
orthogonal  to  the  coefficient  vector  for  the  grand  mean  if  and  only  if 
every  level  of  the  factor  appears  the  same  number  of  times. 

PKXJF.  Let  the  components  of  the  s^caaponent  vector  z  be  the 
numbers  of  occurrences  of  each  level  of  the  factor  in  the  design.  The 
inner  product  of  the  coefficient  vector  for  a  main-effect  parameter 
and  the  coefficient  vector  for  the  grand  mean  is  equal  to  a  non-zero 
scalar  times  the  inner  product  of  z  and  the  vector  in  which 
defines  the  main  effect .  The  vector  z  is  orthogonal  to  all  the 
vectors  in  defining  main  effects  if  and  only  if  z  is  a  multiple 
of  the  remaining  basis  vector,  namely  the  vector  of  constants .  There¬ 
fore,  all  components  of  z  are  equal.  This  concludes  the  jppof  • 

i  ":v~*We  now  ask  when  all  the  coefficient  vectors  for  main  effects  of 
Coe  factor  are  orthogonal  to  all  the  coefficient  vectors  for  main 
effects  of  another  factor.  Because  the  tensor  product  parameteriza¬ 
tion  is  being  used,  the  inner  product  of  two  coefficient  vectors,  one 
for  a  main  effect  of  one  factor  and  the  other  for  a  main  effect  of  the 
other,  is  numerically  the  seme  as  the  inner  product  of  the  coefficient 
vectors  for  one  of  the  two-factor  interaction  parameters  and  the  grand 
mean.  We  will  now  prove  that  this  observation  is  true  more  generally. 


THEOREM  2  *  All  coefficient  vectors  associated  with  the  m-factor 
interactions  among  factors  Ap  •••  ,  A^  are  orthogonal  to  those  for 
the  n-factor  interactions  among  factors  Bp  •  •  •  ,  Bn  if  and  only  if 
all  coefficient  vectors  for  the  (m+n)  -factor  interactions  are  orthog¬ 
onal  to  the  coefficient  vector  for  the  grand  mean . 

PROOF.  Because  of  the  say  interactions  were  defined,  there  is  an 
(mn)  -factor  interaction  parameter  whose  coefficient  vector  has  as 
components  the  product  of  coefficients  for  any  given  "A-lnteraetion” 
and  any  given  "B-interaction" .  Therefore,  the  inner  product  of  coef¬ 
ficient  vectors  for  an  A- interaction  and  a  B-interaction  is  numer¬ 
ically  equal  to  the  inner  product  of  coefficient  vectors  for  an 
"AB- interaction"  and  the  grand  mean. 

Theorem  1  can  now  be  generalized  to  include  orthogonal  designs 
for  estimating  all  the  effects  and  int^^tions  among  several  factop. 

THEOREM  5 .  Given  v  factors  Ap  ,  Ay,  the  coefficient 
vectors  for  the  grand  mean,  main  effects,  and  interactions  of  all 
orders  are  mutually  orthogonal  if  and  only  if  all  possible  cqihinatioas 
of  levels  appear  the  same  number  of  times . 

PROOF.  If  every  combination  appears  the  same  number  of  times 
then  the  design  is  a  replicated  full  factorial  in  the  v  factors 
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•••  ,  A^,  so  that  in  particular  the  design  is  orthogonal .  Hew 
suppose  all  the  coefficient  vectors  are  orthogonal .  In  particular, 
all  the  coefficient  vectors  for  main  effects  and  interactions  are 
orthogonal  to  that  for  the  mean.  In  the  space  which  is  the  tensor 
product  of  V1  through  let  z  be  the  vector  whose  components 

specify  how  many  times  each  treatment  combination  appears  in  the  design. 
Any  given  effect  or  interaction  parameter  is  defined  in  terms  of  some 
vector,  call  it  b,  which  is  also  in  the  tensor  product.  Then  the 
inner  product  of  the  coefficient  vector  for  the  given  parameter  with 
the  coefficient  vector  for  the  grand  mean  is  a  scalar  times  z'b.  The 
vector  z  is  required  to  be  orthogonal  to  all  such  b;  therefore  z 
must  again  have  all  its  components  equal,  which  concludes  the  proof. 

In  view  of  Theorems  Z  and  3,  we  have  the  following  immediate 
corollary,  which  serves  to  characterize  orthogonal  resolution  ttl 
designs. 

ThEGKEM  4.  A  design  of  resolution  t+i  is  orthogonal  under  any 
parameterization  derived  from  a  complete  tensor  product  parameteriza¬ 
tion  by  Ignoring  interactions  involving  £(t+l)  or  more  factors  if 
and  only  if  for  every  subset  of  t  factors  all  possible  combinations 
of  levels  appear  the  same  number  of  times . 


PROOF.  If  t  Is  even,  all  coefficient  vectors  for  parameters 
Involving  it  factors  are  required  to  be  orthogonal  to  coefficient 
vectors  for  other  parameters  Involving  it  or  fever  factors .  By 
Theorem  2,  this  happens  if  and  only  if  all  coefficient  vectors  for 
parameters  Involving  t  or  fever  factors  are  orthogonal  to  the  coef¬ 
ficient  vector  for  the  grand  mean.  By  lbeorem  3,  this  is  equivalent 
to  the  requirement  that  for  every  subset  of  t  factors  all  possible 
combinations  of  levels  appear  the  same  maaber  of  times.  If  t  Is 
odd,  all  coefficient  vectors  for  parameters  involving  i(t-l)  factors 
are  required  to  be  orthogonal  to  coefficient  vectors  for  other  param¬ 
eters  involving  i(Ul)  or  fever  factors,  which  again  Is  the  case  if 
and  only  if  all  coefficient  vectors  for  parameters  involving  t  or 
fever  factors  are  orthogonal  to  the  coefficient  vector  for  the  grand 
mean.  The  remainder  of  the  proof  proceeds  as  when  t  is  even. 

Alternative  paraaaeterizatlons 

Theorem  4  remains  true  under  a  vide  variety  of  parameter  lotions . 
In  fact,  ve  can  state  the  following: 

THEOREM  3.  If  0  is  a  vector  of  parameters  which  are  defined  by 
Ignoring  interactions  of  r+l  or  more  factors  from  a  complete  tensor 
product  parameterization  0*,  if  a  is  a  linear  transform  of  0, 
say  a  -  C  and  if  the  full  factorial  design  is  orthogonal  under 


the  p&zmeterization  tt,  then  Theorem  4  is  valid  for  the  parameteri¬ 
zation  05. 

PROOF.  Let  Q  and  X  be  the  coefficient  matrices  under  the  param¬ 
eterization  J3  for  the  fall  factorial  design  aM  for  any  other  orthog¬ 
onal  design  with  say  If  runs,  respectively.  Then  the  conditions  of 
the  theorem  imply  that  Q'Q,  X'X,  and  C'Q'QC  are  diagonal  matrices. 

But  since  X  and  Q  represent  designs  orthogonal  under  the  same 
tensor  product  parameterization,  X'X  must  be  equal  to  the  constant 
H/H^  times  Q'Q.  Therefore,  C'X'XC  is  also  diagonal,  as  was  to 
be  proved. 

Recently,  a  theorem  has  been  published  by  Addelman  (1962)  which 
states  that  a  design  of  resolution  3  is  orthogonal  if  and  only  if  for 
every  pair  of  factors  the  number  of  occurrences  of  the  conib ination  of 
levels  ij  is  given  by  n . .  *  n .  n  ,/n.  Here  n.  and  n  .  are  the 

ij  i*  i*  *j 

numbers  of  occurrences  of  level  i  of  the  first  factor  and  level  J 
of  the  second  factor,  respectively.  Such  designs  are  called 
proportional- frequency  designs.  The  apparent  contradiction  between 
Addelman*  s  theorem  and  Theorem  4  above  stems  from  a  different  defini¬ 
tion  of  grand  mean.  Hie  grand  mean  is  the  expected  value  of  the 
average  of  the  observations  made,  while  the  one  us,ed  above  is  the 
average  of  the  expected  values  at  all  possible  treatment  combinations. 


Although  the  first  published  proof  of  this  theorem  is  apparently  that 
of  Addelman  (Addelman  and  IDampthome,  1961),  its  validity  was 
apparently  recognized  by  Stevens  (1943) .  Using  the  present  definition 
of  the  grand  mean,  but  defining  main  effects  by  linear  combinations  of 
the  n.  other  than  orthogonal  contrasts,  Plackett  (194-6)  showed 
that  the  condition  of  proportional  frequencies  is  necessary,  but  not 
sufficient,  for  a  design  to  be  orthogonal. 

The  question  naturally  arises  as  to  which  definition  of  grand 
mean  is  preferable.  If  it  can  be  supposed  that  the  mean  is  only  a 
nuisance  parameter,  as  for  example  in  a  screening  experiment, 

Addelman* s  definition  would  likely  be  the  more  useful.  If,  however, 
the  purpose  of  the  experiment  is  to  describe  the  response  over  the 
points  of  the  full  factorial,  then  the  present  definition  seems  more 
appropriate  as  can  be  seen  by  the  following  argument. 

Since  we  are  supposing  that  all  points  are  of  interest,  the 
criterion  for  judging  designs  should  depend  on  the  variances  of  pre¬ 
dicted  values  at  all  treatment  combinations  of  the  full  factorial. 
Since  these  variances  approach  zero  as  the  number  of  runs  increases, 
the  criterion  should  be  adjusted  in  seme  way  for  the  number  of  runs,  N. 
A  convenient  and  realistic  criterion  is  therefore  the  average  variance 
of  predicted  values  multiplied  by  N.  Under  this  criterion  the  best 
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main-effect  designs  are  those  which  are  orthogonal  using  the  present 
definition  of  the  grand  mean,  as  the  following  theorem  demonstrates . 

The  theorem  was  given  previously  (Webb,  1964),  and  is  reproduced  here 
for  convenience. 

THEOREM  6.  In  a  design  for  estimating  parameters  under  any  main- 
effect  parameterization,  the  average  variance  of  a  predicted  value 
times  the  number  of  runs  is  minimized  if  and  only  if  the  design  is  an 
equal-frequency  design.  In  this  case  the  variances  of  all,  predicted 
values  are  equal. 

PROOF.  Let  X  be  the  coefficient  matrix  of  a  design  under  book:  main' 
effect  parameterization .  Let  Z  be  the  coefficient  matrix  of  the  full 
factorial.  Let  X  contain  N  rows,  Z  contain  M  rows,  and  let 
there  be  p  parameters .  The  value  of  the  criterion  being  considered 
is  (a2N/M)trace  Z(X*X)  ^Z1,  where  cr2  is  the  error  variance  of  an 
observation . 

If  EY  »  Xp  under  the  given  parameterization  and  A  is  any 
nonsingular  matrix,  then  under  the  parameterization  Of  =  A-1p  the 
covariance  matrix  of  predicted  values  is  ^ZACA'X'XAJ^A'Z',  which 
equals  o2Z(X'X)  ^Z*  .  Therefore  the  choice  of  parameterization  is 
arbitrary.  Without  loss  of  generality  we  will  choose  a  tensor  product 
parameterization  in  which  the  main  effects  are  scaled  so  that  Z'Z  is 
equal  to  the  constant  M  times  the  identity  matrix. 
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Now  choose  any  factor  at  say  s  levels  and  consider  columns  of 
Z  corresponding  to  the  s-l  main  effects  of  the  given  factor.  By 
rearranging  the  rows  of  Z,  these  columns  can  be  expressed  as  an 
s  by  s-l  matrix  S  repeated  vertically  M/s  times .  Letting  i  be 
a  vector  containing  s  ones,  we  observe  now  that  the  matrix 
l//s  •  [i,S]  is  orthonormal .  It  follows  that  the  sum  of  squares  of 
any  row  (or  column)  is  equal  to  unity.  Therefore  the  sum  of  squares 
of  any  row  of  S  is  equal  to  s-l.  This  argument  may  be  repeated  for 
each  factor,  so  that  the  sum  of  squares  of  any  row  of  Z  is  equal  to 
p,  the  number  of  parameters .  Since  the  matrix  X  contains  a  selec¬ 
tion  of  the  rows  of  Z,  the  sum  of  squares  of  any  row  of  X  is  also 
P- 


Returning  to  the  average  variance  of  a  predicted  value,  note  that 
a2/!*  tr  Z^'X)’3^'  «  a2/M  tr  (X’xrVz  *  er2  tr  (X'X)"1.  But  the  last 
expression,  is  equal  to  c2  •  l/X^,  where  the  1^  are  the  eigen¬ 

values  of  X'X.  However,  the  **um  T.  X i«  fixed  since 
£  =  tr  X'X  «  tr  XX'  »  N  •  p  (the  sum  of  squares  of  any  row  of  X 

is  equal  to  p)  .  If  the  sum  of  p  positive  numbers  if  fixed,  the  sum 
of  the  reciprocals  is  minimized  if  the  numbers  are  all  equal.  The 
eigenvalues  of  a  matrix  are  equal  if  and  only  if  the  matrix  is  a 
constant  times  the  identity.  Therefore  the  average  variance  of  a  pre¬ 
dicted  value  is  minimized  if  and  only  if  X'X  is  equal  to  N  times 
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the  Identity  and.  then  the  average  variance  is  o  p/N  •  Since  a  tensor 

product  parameterization  vas  used,  Theorem  4  applies  and  the  criterion 

2 

attains  its  minimum  value  a  p  whenever  for  each  subset  of  two  factors 
each  combination  of  levels  appears  the  same  number  of  times.  Further¬ 
more  this  result  is  independent  of  the  parameterization  used  and  of  the 
number  of  runs.  If  X'X  is  equal  to  a  constant  times  the  identity 
matrix  it  is  easy  to  see  that  the  diagonal  elements  of  Z(XiX)~1Zl  are 
all  equal.  This  completes  the  proof  of  Theorem  6. 


3.  CONSTRUCTING  DESIGNS  BY  LINEAR  PROGRAMMING 


Conversion  to  a  linear  programming  problem 

In  this  section  a  detailed  method  for  recharacterizing  an  orthog¬ 
onal  design  as  an  Integer  solution  to  a  linear  programming  problem  is 
presented.  First,  consider  the  special  case  of  symmetric  designs,  in 
which  all  n  factors  are  at  s  levels.  Theorem  4  implies  that,  for 
any  t  factors,  each  combination  of  levels  occurs  the  same  number  of 
times,  X.  This  number  is  called  the  index  of  the  design  (Bose  and 
Bush,  1952),  and  the  number  of  runs,  N,  in  the  design  is  given  by 
N  =  Xs  .  Let  J q  be  a  collection  of  ail  treatment  combinations  which 
are  identical  in  t  components,  say  the  first  t.  There  will  be 
s  vectors  in  JQ.  Let  w^  be  the  numbei  of  occurrences  of 
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treatment  combination,  j  in  an  orthogonal  resolution  t+1  design. 

Then  Theorem  4  requires  that  E.  _  v.  be  equal  to  X.  But  the  first 

J 

+ 

t  components  may  be  fixed  at  any  of  s  sets  of  values,  each  of  which 

yields  a  constraint  E.  _w  a  X.  This  argument  may  be  repeated  for  all 

jej  j 

(£)  subsets  of  t  factors,  so  that  in  all  s^  X  (£)  constraints  may 
be  written  down. 

The  analogous  results  are  more  complicated  for  asymmetric  designs, 
in  which  the  n  factors  appear  at  Sp  s 2,  •  •  •  ,  s^  levels,  respec¬ 
tively.  For  convenience,  we  first  extend  the  definition  of  index  to 
apply  to  asymmetric  designs  as  well. 

Let  1^  be  a  subset  of  t  of  the  integers  1  through  n,  and 

let  the  collection  C 1^ { ye T 3  contain  every  such  subset.  The  index 

class  T  will  be  the  integers  1,  2,  ••*,(£).  Let  K  be  the  least 

common  multiple  of  the  numbers  11^-  s^,  as  y  ranges  over  T.  The 

7 

number  of  runs  in  an  orthogonal  resolution  t+1  design  must  be  a 
multiple  K,  say  N  =  XK.  This  number  X  may  be  defined  as  the 
index  of  the  design. 

Suppose  is  one  of  the  H.^  s^^  collections  of  all  those 

treatment  combinations  j  which  have  fixed  values  for  the  set  of 
components  whose  ordinal  numbers  are  in  the  set  Iv.  Then  Theorem  4 
fcTOUee  EJeJ  v3  = 


KX/n^i  sr  As  7  ranges  through  f,  a  set  of 
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constraints  is  generated  which  must  be  satisfied  by  any  orthogonal 
design.  This  set  i3  linearly  dependent.  Two  methods  for  reducing  the 
complete  set  to  an  equivalent  set  of  linearly  independent  constraints 
are  described  in  the  Appendix.  The  constraint  matrices  given  in  the 
examples  below  were  constructed  by  using  one  of  these  methods  (the 
"base-set  method")  • 

Eliminating  unacceptable  solutions 

The  problem  of  finding  the  orthogonal  design  with  the  smallest 
value  of  N  for  a  given  set  of  factors  and  given  resolution  is  now 
equivalent  to  finding  a  set  of  non-negative  integer  values  for  the 
variables  w^  and  X  such  that  the  set  of  constraints  summarized  by 

the  problem  is  stated  in  this  fora,  then  w^  *  0,  X  a  0  provides  a 
trivial  solution.  To  rule  out  this  solution,  the  additional  require¬ 
ment  that  X  be  positive  can  be  added.  A  convenient  way  to  include 
this  requirement  is  to  add  a  new  non-negative  variable  X*  and  the 
constraint  X~X*  «  1. 

The  full  factorial  is  orthogonal  and  hence  satisfies  the  con¬ 
straints  with  each  variable  w^  equal  to  unity  and  the  number  of  runs 
equal  to  By  multiplying  each  variable  by  the  fraction  K/  IX  s 

another  unacceptable  solution  is  obtained  with  X  equal  to  one.  It 


KX/n^i  s^  is  satisfied  and  such  that  X  is  minimized.  If 


follows  from  Theorem  4  that  an  orthogonal  design  remains  orthogonal  If 
the  designations  of  the  levels  of  any  faetor  are  rearranged.  Since  a 
design  must  have  f  t  least  one  run  at  some  treatment  combination,  we 
may  require  that  an  arbitrary  one  of  the  v^,  say  .  q,  be  greater 
than  or  equal  to  one .  The  addition  to  the  problem  of  a  non-negative 
variable  v*  _  and  the  constraint  v~  ~  -  v?  ^  *  1  will  rule 

out  the  unacceptable  solution  corresponding  to  X  »  1. 

Non- integer  solutions  may  still  exist,  but  in  the  examples  worked 
to  date,  the  use  of  the  simplex  method  for  linear  programming  together 
with  a  simple  algorithmic  device  has  led  to  the  smallest  orthogonal 
designs.  The  device  is  as  follows:  if  the  minimum  solution  to  the 
problem  obtain*  i  by  the  simplex  method  is  such  that  X  is  not  an 
integer,  change  the  problem  by  requiring  that  X  be  greater  than  or 
equal  to  the  next  larger  integer.  This  can  be  done  by  revising  the 
constraint  X-X*  =  1,  rather  than  by  adding  a  new  constraint,  so  that 
some  of  the  previous  feasible  solutions  may  still  be  feasible  solutions 
to  the  revised  problem.  Thus,  it  is  not  necessary  to  start  the  linear 
programming  procedure  over  from  the  beginning . 

Numerical  example 

The  following  example  is  intended  only  to  illustrate  the  steps 
involved  in  constructing  designs  by  linear  programming,  the  resulting 
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design  being  a  familiar  fractional  factorial.  In  Section  4  some 
designs  will  be  derived  which  are  not  fractional  factorials,  but  their 
construction  does  not  illustrate  the  general  procedure.  It  will  be 
assumed  that  the  reader  is  familiar  with  the  basic  simplex  method  as 
described  in  any  linear  programming  text,  for  example,  Hadley  (1962)  . 

k 

We  will  derive  the  smallest  design  of  resolution  3  for  a  2  exper¬ 
imental  situation.  For  convenience  we  abandon  the  vector  subscripts  on 
the  Wj  and  substitute  numerical  subscripts.  Let  w^  be  the  number 
of  occurrences  of  that  treatment  combination  which  is  the  binary  expan¬ 
sion  of  the  decimal  number  j .  Thus  w^  is  the  number  of  occurrences 
of  OCXX),  w1  is  the  number  of  occurrences  of  0001,  and  so  forth. 

The  independent  linear  constraints  (obtained  by  using  the  base-set 
method  described  in  the  Appendix)  are  given  by  Aw  *  b,  where 
w  «  X*,  v£,  wQ,  wx,  •••  ,  v15)',  b  *  (1,  1,  0,  •  ••  ,  0)',  and  A 

is  the  following  matrix  (a  dot  is  used  for  zero) . 
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In  order  to  start  the  simplex  method,  an  initial  basic  feasible 
solution  is  required.  The  solution  corresponding  to  the  full  factorial 
design  is  X  =  k,  X*  =  3*  w*  =  0,  and  =  1  for  j  =  0,  1,  •  •  •  ,  15* 

This  solution  is  feasible  but  not  basic. 

Denoting  the  columns  of  the  matrix  A  by  A,  A*,  A*,  Aq,  Ap  y 
A^_,  and  substituting  the  full -factorial  solution  into  tie  equation 
Aw  e  b,  we  obtain  the  equation  k-A  +  3A*  +  A^  »  b .  An  obvious 

linear  relationship  among  the  columns  of  A  is 

A  +  A*  +  A^  +  A,.  +  A^  -  A^  +  Aq  +  A^j-  =  0.  Subtracting  this  equation 
from  the  previous  one  yields  the  equation 

3A+2A*+A0+A1+A2+A^+2A^+A9-i-A10+A11+A12+A15+A1^  *  b,  to  which  corresponds 
the  solution  X  *=  3,  X*  a  2,  =  2,  w^  =  1  for  J  a  0,  1,  2,  k,  9,  10, 

11,  12,  13,  and  14,  and  the  remaining  variables  zero.  It  can  be 
verified  that  the  columns  with  non-zero  coefficients  in  the  last  equa¬ 
tion  are  linearly  independent .  Moreover,  there  are  exactly  thirteen 
such  columns,  the  same  as  the  number  of  rows,  that  this  solution  is 
basic  and  feasible.  For  other  designs  a  set  of  linearly  independent 
columns  of  the  constraint  matrix  can  be  obtained  by  using  the  solution 
corresponding  to  a  known  fractional  factorial.  It  may  then  be  necessary 
to  adjoin  additional  columns  in  order  to  have  a  full  basis. 

Returning  to  the  present  example,  let  B  be  the  basis  matrix 
consisting  of  the  chosen  set  of  linearly  independent  columns,  that  is, 
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B  «  [A,  A  ,  Aq,  A^  kg,  A^,  A+j,  A ^  A^q,  A^  •) ,  A A A^].  ^ 
applying  the  simplex  method  ve  first  compute  G  *  B~^A  and  a  B-1b . 
Those  columns  of  G  which  are  not  columns  of  the  identity  matrix  and 
the  vector  v_  are  as  follows : 


Since  the  linear  form  to  be  minimized  is  just  the  first  variable 
A,  a  column  of  A  should  be  introduced  into  the  basis  only  if  the 
first  component  of  the  corresponding  column  of  G  is  positive,  for 
only  then  could  A  decrease.  The  column  A^  will  be  chosen  to  enter. 
In  order  to  maintain  feasibility,  the  column  chosen  to  leave  the  basis 
must  be  such  that  for  >  0  (g^  is  the  i-th  component  of 

column  G^)  the  ratio  v^/g^  is  minimized.  The  smallest  value  is 
1  which  is  attained  for  i  *  4,  5,  10,  and  11.  If  we  allow  A-^  to 
leave  the  basis  (for  which  i  =  4)  and  recompute  G  and  wfi  using 
the  revised  basis  matrix,  ve  obtain: 
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[Gj,G1,G5,G6,G8,G15]  = 


-2 

-1 

0 

1 

-1 

-1 

-  1 

2 

-2 

-1 

0 

1 

-1 

-1 

1 

-1 

0 

0 

0 

0 

0 

1 

-1 

1 

1 

0 

-1 

-1 

1 

0 

-1 

-1 

1 

0 

0 

0 

-1 

0 

1 

1 

-1 

1 

-1 

0 

0 

0 

0 

0 

,  WP  * 

1 

-1 

0 

0 

1 

0 

-1 

n 

1 

-1 

0 

1 

0 

0 

-1 

1 

0 

-1 

-1 

0 

0 

1 

0 

0 

-1 

-1 

0 

1 

0 

0 

-1 

0 

1 

0 

-1 

0 

1 

-1 

0 

0 

1 

-1 

0 

1 

The  solution  now  corresponds  to  one  of  the  half -replicates  of  a 

4 

2  design.  Since  G^  is  the  only  one  of  these  vectors  whose  first 
component  is  positive,  is  the  only  vector  whose  admission  into  the 

basis  could  make  1  smaller.  For  the  fifth  row  the  ratio  ±/ei6 
has  the  value  zero,  so  that  the  corresponding  column  of  A,  namely 
A g,  is  the  only  column  which  can  leave.  Since  the  minimum  value  is 
zero,  the  solution  does  not  change.  The  basis  matrix  is  now 
B  **  [A,  A  ,  Aq,  A^,  A^,  Aj^,  Ay,  A^,  A^q,  A^  ^ ,  A^g,  A^y  A^J. 

The  columns  of  G  of  interest  are  now  the  following: 
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[gJ,g1,g2,g5,g8,g15] 


-2  0-11  -1 

-1  0-1  1  -1 

-1  0  o  0  0 

-110  1-1 
0-1  1-1  0 

-1  1-1  2  -1 

-10  0  10 

-11-110 
-10  0  10 

0-1  0-1  0 

0-1  0-1  1 

-10  0  1-1 

-11-11  -1 


-1 

2 

-1 

1 

0 

1 

-1 

1 

0 

0 

-1 

1 

0 

-1 

W  ai 

'  B 

1 

1 

-1 

1 

1 

0 

0 

0 

0 

1 

c 

1 

L.  j 

The  vector  can  be  Introduced  Into  the  basis,  and  the  minimum 

I> 

value  of  the  ratio  is  &  bo  that  must  leave  the  basis . 

The  new  solution  is  v  =  ( l£,  1, 0, 0, 0,  \}  0,  h,  £,  0,  \}  £,  h  t,  0)  • 

By  recoorputation  of  G  it  may  be  verified  that  no  new  column  can  come 
into  the  basis,  so  that  the  present  solution  is  a  minimum  basic  feasi¬ 
ble  solution.  It  does  not  consist  of  integers,  however,  and  therefore 
does  not  correspond  to  an  actual  design.  Nevertheless,  ve  now  Know 
that  there  can  be  no  design  with  1-1,  so  the  smallest  design  must 
have  X  i  2.  We  have  incidently  obtained  a  solution  with  A,  »  2, 
however,  so  that  the  half-replicate  corresponding  to  that  solution  is 

one  of  the  class  of  smallest  orthogonal  designs,  as  are  the  other 

k 

known  half -replicates  of  a  2  design. 
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k .  ALTERNATIVE  ORTHOGONAL  DESIGNS  AND  PARTIAL  DUPLICATION 


The  technique  of  viewing  an  experimental  design  as  a  solution  to 
a  set  of  linear  constraints  lends  itself  naturally  to  the  investigation 
of  possible  alternative  designs.  Consider  a  simple  experiment  involving 
two  factors  at.  two  levels  and  one  at  three  levels,  and  suppose  that  only 
the  grand  mean  and  main  effects  are  to  be  estimated .  For  this  experi¬ 
mental  situation  the  constant  K  is  equal  to  twelve,  and  therefore  an 
orthogonal  design  must  contain  121  runs,  where  X  is  the  index.  The 
full  factorial  is  orthogonal  and  involves  exactly  twelve  runs,  and 
therefore  is  a  smallest  orthogonal  resolution  5  design  for  this  situa¬ 
tion.  One  may  ask  whether  there  are  any  other  orthogonal  designs  with 
Just  twelve  runs. 

Let  the  third  component  of  the  treatment  combination  vector  repre¬ 
sent  the  level  of  the  three-level  factor.  The  linear  constraints  may 

be  written  in  a  form  expressing  each  w  as  a  linear  combination  of 

J 

X,  w0,  and  w^,  where  and  are  the  numbers  of  occurrences  of 

treatment  combinations  000  and  001.  These  linear  combinations 
appear  in  the  second  column  of  the  following  table . 


Treatment 

Combination 


Variable 


Solutions 


000 

vo 

l 

l 

2 

001 

wi 

l 

0 

1 

002 

V 

2 

c 

JA-Wq-Wi 

1 

2 

0 

010 

V 

3 

m 

2X-v0 

1 

1 

0 

Oil 

i 

- 

2A-v1 

1 

2 

1 

012 

V 

.  5 

= 

V0+V1‘A 

1 

0 

2 

100 

v6 

sc 

2X-vq 

1 

1 

101 

W7 

2l' 

2A-v^ 

l 

2 

1 

102 

v8 

u 

V0+V1A 

l 

0 

2 

no 

- 

v9 

as 

w0 

1 

1 

2 

111 

v10 

- 

W1 

1 

0 

1 

112 

wn 

a 

3A-v0-Wi 

1 

2 

0 

X 

i 

mm 

■a 

The  full  factorial  corresponds  to  taking  wQ  »  1,  w^  =  1,  and 
A  m  1,  and  this  is  the  first  of  the  three  solutions  given  in  the 
table . 


If  vQ  and  arenas signed  values  it  is  possible  to  construct 

a  design  with  these  values  if  A  is  chosen  properly.  It  seem6 
plausible  that  if  the  value  of  is  decreased  to  zero,  then  the 

value  of  A  would  not  have  to  increase.  This  is  indeed  the  case,  and 
the  corresponding  solution  is  the  second  one  given  in  the  above  table. 
The  structure  of  this  design  is  interesting.  If  the  0  level  of  the 
third  factor  is  ignored,  the  design  is  a  half-replicate  of  a  2  run 
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twice.  Also  given  in  the  above  table  is  a  third  design,  which  is  a 
repeated  half -replicate  of  ^  2  if  level  1  of  the  third  variable  is 
ignored. 

There  are  several  alternative  designs  available  in  many  experi- 

2 

mental  situations .  In  the  case  of  a  2*3  situation  there  are,  in 
addition  to  the  full  factorial,  orthogonal  resolution  3  designs  with 
the  same  number  of  runs  in  which  four  treatment  combinations  are  dupli¬ 
cated  and  others  in  which  six  treatment  combinations  are  duplicated . 

There  are  two  basic  reasons  for  the  selection  of  alternative 
designs .  The  first  is  that  sometimes  certain  sets  of  experimental 
conditions  can  not  be  attained,  and  there  may  be  an  alternative  design 
which  does  not  include  the  taboo  treatment  combinations .  The  second 
reason  is  that  alternative  designs  often  provide  partial  duplication. 

The  argument  for  partial  duplication  has  been  given  by  Daniel 
(1957)  and  by  Dykstra  (1959) •  Briefly,  it  is  this;  if  partial  dupli¬ 
cation  is  present,  an  error  estimate  is  available  which  is  unaffected 
by  the  presence  of  high-order  interactions .  Dykstra  gave  a  catalogue 
of  partially  duplicated  designs,  some  of  which  are  due  to  Daniel.  The 
designs  catalogued  are  constructed  by  combining  pairs  of  fractional 
factorial  designs,  and  are  all  non-orthogonal . 
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The  linear  programming  procedure  for  orthogonal  designs  can  be 
modified  to  obtain  a  procedure  for  finding  the  smallest  orthogonal 
design  involving  partial  duplication .  Rather  than  requiring  that 
be  greater  than  or  equal  to  one,  we  require  that  be  greater  than 
or  equal  to  two.  Consider  again  a  2  experimental  situation  as  in  the 
numerical  example  of  Section  3*  The  constraint  matrix  A  remains 
unchanged.  Since  the  smallest  orthogonal  design  has  1  equal  to  2, 
we  may  start  with  the  constraint  X-X*  =3  2 .  The  vector  b  has  its 
first  two  components  equal  to  2  and  the  remaining  components  equal  to 
zero.  The  minimum  basic  feasible  solution  to  this  revised  problem  is 
twice  that  vhich  was  obtained  in  the  example;  that  is, 

1 

w  «  (3,  1,  0,  2,  0,  0,  1,  0,  1,  1,  1,  0,  1,  1,  1,  1,  1,  1,  0)  .  Among 
k 

incomplete  2  designs  of  resolution  3>  the  corresponding  design  is 
therefore  the  smallest  which  is  orthogonal  and  involves  partial  dupli¬ 
cation  .  A  single  duplicated  point  would  ordinarily  not  be  enough,  and 
since  the  given  design  is  a  unique  minimum,  a  larger  design  is  required. 
In  sixteen  runs  the  half -replicate  can  be  repeated,  so  that  in  prac¬ 
tical  situations  one  would  either  use  this  design  or  a  non-orthogonal 
design  such  as  one  of  those  given  by  Dykstra. 
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APPENDIX 


In  Section  3  it  was  noted  that  the  set  of  all  possible  constraints 
implied  by  Theorem  4  is  linearly  dependent.  In  this  Appendix  are  pre¬ 
sented  two  methods  for  arriving  at  equivalent  sets  of  linearly  inde¬ 
pendent  constraints  for  orthogonal  factorial  designs.  These  will  be 
called  the  base -set  method  and  the  crossing-out  method.  The  base -set 
method  consists  of  finding  a  base  set  of  the  w.  and  X,  which  is  any 
largest  linearly  independent  subset  of  the  collection  of  w^  and  X. 
Once  a  base  set  is  found,  all  the  constraints  can  be  written  down  as 
expressions  for  w^  not  in  the  base  set.  The  crossing-out  method 
involves  writing  down  all  possible  constraints  and  then  systematically 
crossing  out  those  which  are  linear  combinations  of  previous  ones. 

This  Appendix  contains  details  of  the  two  techniques,  examples  of 
their  use,  and  a  proof  of  their  validity.  The  proof  will  proceed  by 
showing  that  the  constraints  arrived  at  by  using  the  base -set  method  are 
linearly  independent,  that  the  constraints  arrived  at  by  using  the 
crossing-out  method  are  equivalent  to  the  set  of  all  constraints,  and 
finally  that  the  same  number  of  constraints  is  obtained  by  using  either 
method. 
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The  base -set  method 


Once  again  let  represent  the  number  of  occurrences  of  that 
treatment  combination  whose  vector  representation  is  the  vector  j . 

That  level  of  a  factor  which  is  designated  by  the  largest  number  will 
be  called  the  "highest  level"  for  that  factor.  We  will  see  that  the 
set  of  Wj  for  which  j  has  n-t-1  or  fewer  components  at  their 
highest  levels  is  a  base  set  for  an  orthogonal  resolution  t+1  design 
for  nit  factors .  Obviously  this  is  but  one  of  a  large  number  of 
base  sets.  If  n  equals  t,  then  every  w^  is  equal  to  X,  so  that 
the  base  set  consists  of  Just  this  element. 

For  convenience,  the  alleged  base  set  will  be  referred  to  as  the 
base  set,  even  though  this  will  not  be  established  until  later.  If  it 
can  be  shown  that  each  w^  not  in  the  base  set  can  be  expressed  as  a 
linear  combination  of  variables  in  the  base  set,  then  it  follows  that 
the  set  of  constraints  generated  in  this  way  is  linearly  independent, 
since  each  involves  a  unique  variable  not  appearing  in  any  other  con¬ 
straint.  It  is  enough  to  show  that  an  expression  for  each  w^  not  in 

the  base  set  can  be  derived  in  terms  of  X  and  w  with  fewer  compo- 

J 

nents  of  J  at  their  highest  levels . 


Let  Wj  be  any  w^  such  that  J  has  exactly  n-t+h  components 
at  their  highest  levels,  where  h  is  an  integer  between  0  and  t. 
Then  Wj  is  not  in  the  base  6et.  For  convenience,  let  these  components 
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be  the  last  n-t+h .  By  Theorem  4,  the  number  of  times  the  first  t 
factors  occur  together  at  the  levels  appearing  as  the  first  t  compo¬ 
nents  of  J  is  a  known  multiple  M  of  X.  (The  constant  M  depends 
on  the  numbers  of  levels  of  the  n  factors  and  on  which  are  at  their 
highest  levels  in  J.)  Therefore,  MX  equals  the  sum  of  all  those  w^ 
for  which  j  is  identical  with  J  in  the  first  t  components .  This 

constraint  can  be  rewritten  as  w_  equals  MX  minus  a  sum  of  w  for 

«  J 

which  j  has  n-t+h-1  or  fewer  components  at  their  highest  levels. 
Therefore,  as  asserted,  any  w^  not  in  the  base  set  can  be  expressed 
as  a  linear  combination  of  X  and  w^  with  fewer  components  at  their 
highest  levels  . 


To  Illustrate  we  will  construct  a  set  of  linearly  independent 

4 

constraints  for  the  resolution-^*  2  design,  as  in  the  example  of  Section 

3.  A  base  set  is  (VQ000J  *0001'  W0Q10,  V01QO'  W10G0' 
the  first  two  cooponents  of  j  at  0  and  0,  we  see  that 

w0000  +  w0001  +  w0010  +  WOOU  *  *■>  wMcb  yieMs  “1  expression  for 
w0011  ^  ^erms  of  the  base  set .  The  following  table  lists 

in  schematic  form  expressions  for  the  w  not  in  the  base  set  in  terms 

J 

of  variables  whose  subscripts  have  fewer  components  at  their  high 

levels.  The  subscripts  J  are  used  for  the  variables  w  .  The  third 

J 

column  lists  the  ordinal  numbers  of  the  components  which  are  held  fixed 
n  deriving  the  given  expressions. 
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Variable  Expression  Fixed  C proponents 


0011 

X 

- 

0000 

mm 

0001 

- 

0010 

1 

and 

2 

0101 

X 

- 

0000 

- 

0001 

- 

0100 

1 

and 

3 

0110 

X 

- 

0000 

- 

0010 

- 

0100 

1 

and 

4 

0111 

X 

- 

0100 

- 

0101 

- 

0110 

1 

and 

2 

1001 

X 

- 

0000 

- 

0001 

mm 

1000 

2 

and 

3 

1010 

X 

mm 

0000 

- 

0010 

- 

1000 

2 

and 

4 

ion 

X 

- 

1000 

mm 

1001 

- 

1010 

1 

and 

2 

1100 

X 

mm 

0000 

- 

0100 

- 

1000 

3 

and 

4 

1101 

X 

- 

1000 

- 

1001 

- 

1100 

1 

and 

3 

mo 

X 

mm 

1000 

- 

1010 

mm 

1100 

1 

and 

4 

nil 

X 

— 

1100 

- 

1101 

• 

mo 

1 

and 

2 

This  same  set  of  constraints,  written  In  matrix  form,  is  used  in  the 
example . 


The  number  of  constraints  using  the  base-set  method  is  the  same  as 


the  number  of  v  not  in  the  base  set,  and  this  in  turn  is  the  number 

v 

of  treatment  combinations  with  n-t  or  more  components  at  their  high¬ 
est  levels .  Let  t  equal  2 .  The  number  of  treatment  combinations 
with  n  components  at  their  highest  levels  is  1,  the  number  with 
n-1  components  at  their  highest  levels  is  the 

number  with  n-2  components  at  their  highest  levels  is 


j?  ,  tf 


(s.  -l) (s.  -l) .  The  total  is  therefore 


h*  W1  ' 

1  +  2(s4  -l)  +  S(s4  -l)(s.  -l)  .  If  t  equals  3,  the  total  number 

n.  n.  \ 

of  constraints  is  l+E^  -l)+E(s1^-l)(si^-l)+E(si^-l)(si^-l)(si  -l), 
where  the  range  of  summation  in  the  last  term  is  lii^<i2<i^cn. 
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The  general  expression  can  "be  written  down  in  the  form 
1  +  Eg  Il^=1  -l),  where  the  set  0^  is  the  set  of  i^  such 

that  1  £  i^  <  ig  <  •••  <i^£n. 

The  crossing-out  method 

The  crossing-out  method  is  a  technique  for  reducing  the  set  of  all 
constraints  by  eliminating  those  which  are  obvious  linear  combinations 
of  others .  The  set  of  all  constraints  contains  statements  about  the 
number  of  occurrences  of  combinations  of  levels  of  subsets  of  t 
factors .  By  adding  together  sets  of  constraints,  statements  can  be 
derived  about  the  number  of  occurrences  of  combinations  of  levels  of 
t-1  factors .  Because  a  given  subset  of  t-1  factors  is  contained  in 
several  subsets  of  t  factors,  such  statements  are  not  unique . 

Indeed,  it  is  because  of  this  that  the  set  of  all  constraints  is  lin¬ 
early  dependent. 

The  crossing-out  technique  will  first  be  described  for  t  «  2 . 
Consider  the  first  two  factors,  which  are  at  s^  and  s 2  levels, 
respectively.  A  set  of  constraints  can  be  written  down  from  the 

requirement  that  the  number  of  occurrences  of  each  of  the  s^s2 
possible  combinations  of  levels  of  the  first  two  factors  must  be  equal 
to  each  other  and  to  a  known  multiple  of  the  index  X.  Let  the 

first  two  components  of  j  be  represented  by  u  and  v,  and  suppose 
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A  DIVISION  OF  NORTH  AMERICAN  AVIATION.  INC. 


rNE  • 


that  the  constraints  are  written  down  in  increasing  numerical  order  of 

equals  the  sum  of  all  w^  for 

which  the  first  two  components  of  j  are  u  and  v  has  ordinal 

number  s„u+v+l  in  the  constraint  array*  If  the  first  s_  relat ion- 

ships  are  added  together,  the  result  is  that  the  number  of  occurrences 
of  level  0  of  the  first  fee  tor  is  equal  to  SgM^X .  By  adding  any  of 

s,  sets  of  s„  consecutive  constraints,  the  number  of  occurrences  of 

any  level  of  the  first  factor  is  SgM^  also.  Similarly,  the  number 
of  occurrences  of  any  level  of  the  second  factor  equals  s^M^X* 


uv.  That  is,  the  constraint  that  KjX 


Considering  now  the  first  and  third  factors,  we  may  write  down 
b-jS^  linear  constraints .  Suppose  they  are  again  arrayed  in  numerical 
order,  so  that  the  constraint  {MgX  equals  the  sum  of  all  w^  for 
which  J  has  u  and  v  as  its  first  and  third  ccnpcnents }  has 
ordinal  nu&ber  b,ukv+1*  By  adding  the  first  s  relationships  we 
find  that  the  number  of  occurrences  of  level  0  of  the  first  factor 
is  s^MgX.  Similarly,  by  considering  each  of  the  sets  of  s^ 
consecutive  relationships,  we  find  that  the  number  of  occurrences  of 
any  level  of  the  first  factor  is  sy^X.  Therefore,  Mg  equals 
(b^/eOK,  ,  and  what  is  more  Important,  ther<;  are  obvious  linear 
dependencies  between  relationships  written  down  by  considering  the 
first  and  second  factors  and  those  written  down  by  considering  the 
first  and  third  factors.  If  every  s^-th  relationship  is  removed  or 
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"crossed  out"  from  the  set  of  B±Bj  relationships  formed  by  consider¬ 
ing  the  first  and  third  factors,  then  these  obvious  dependencies  vill 
be  removed • 

Turning  attention  next  to  the  second  and  third  factors,  one  can 
write  down  s„s,  relationships.  The  first,  second,  •••  ,  s9s  -th 
constraints  are  that  M^X.  is  eq.ual  to  the  sum  of  those  w^  for  which 
the  second  and  third  components  of  j  are,  respectively,  0  and  0, 

0  and  1,  •  *  *  ,  s^  and  .  Again,  by  adding  sets  of  relationships, 
expressions  can  be  obtained  for  the  number  of  occurrences  of  each  of 
the  levels  of  the  second  factor  and  similarly  for  the  third  factor. 

But  expressions  are  already  available  for  each  of  these  numbers  of 
occurrences;  those  for  each  level  of  the  second  factor  are  obtainable 
from  the  first  group  of  s^s^  relationships,  and  those  for  the  number 
of  occurrences  of  each  level  of  the  third  factor  are  obtainable  from 
the  second  group  of  relationships .  Suppose  now  that  every 

s^-th  relationship  is  crossed  out  from  the  set  of  SgS^  relationships 
formed  by  considering  the  second  and  third  factors,  and  also  the  last 
s^  relationships  are  crossed  out .  Then  in  all  s2+s ^-1  constraints, 
and  also  all  obvious  dependencies,  will  be  removed . 

Thus  far  only  three  factors  have  been  considered,  but  the  above 
results  can  easily  be  extended  to  the  remaining  factors  .  From  the  set 
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of  linear  relationships  obtained  by  considering  the  first  and 

i-th  factors,  every  s^-th  relationship  may  be  crossed  out  in  order  to 
eliminate  the  obvious  ncn-uniqueness  of  expressions  for  the  number  of 
occurrences  of  each  level  of  the  first  factor.  From  the  set  of  s.s. 

i  j 

relationships  obtained  by  considering  the  i-th  and  j-th  factors 
(l  <  i  <  j),  every  s^-th  relationship  and  the  last  relationships 

may  be  crossed  out,  since  expressions  are  already  available  for  the 
number  of  occurrences  of  each  level  of  both  the  i-th  and  j-th  factors . 

Before  systematizing  the  crossing-out  method  for  t  £  2,  we  will 
show  that  the  number  of  constraints  remaining  after  using  the  crossing¬ 
out  procedure  is  the  same  as  that  obtained  from  the  base  set  in  the  case 
t  «  2 .  There  are  s^s,^  relationships  remaining  in  the  first  group, 
none  having  been  removed.  In  the  group  obtained  by  considering  the 
first  and  i-tL  factors  there  are  s^s^-s^  constraints  remaining. 

In  the  group  obtained  by  considering  the  i-th  and  j-th  factors  there 
are  s^Sj-s^-s^+l  constraints  remaining.  The  total  may  be  expressed 

88  els2  +  ^3  ‘‘iK"1)  +  *>1*1  (Bi'lKsj-1)  •  We  have  seen  that 

the  number  of  constraints  using  the  base-set  method  is 

A  +  ^1  W"1)  +  ^Ll  ^i+l 

*  1  +  £^2.  ^i"1^  +  ^i.2  ^=1+1  ^Bi"1^sj”l)  +  ^l”1^  ^i”1^ 

-  1  +  a^l+Sj^  {8j-l)  +  E  E  (si'-l)(ej-l) 

*  s^Sg  +  ®2.  S  S  (s^-l)(Sj-l)  • 
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Therefore,  the  methods  yield  equivalent  sets  of  linearly  independent 
constraints . 


Formalization  of  the  crossing-out  method 

We  first  formalize  the  procedure  Just  described  for  t  »  2 .  Con¬ 
sider  each  pair  of  factors  in  the  standard  (alphabetic)  order 
BA,  CA,  & ,  12A,  DB,  DC,  EA,  •  •  •  ,  where  A,  B,  C,  •••  are  the  first, 
second,  third,  *  *  ♦  factors .  We  will  say  that  a  pair  of  factors  is  in 
class  £2  if  exactly  £2  of  the  factors  have  occurred  at  least  once  in 
a  previous  pair.  Thus,  the  pair  BA  is  the  only  class  0  pair,  the 
remaining  pairs  containing  A  are  class  1  pairs,  and  the  rest  are 
class  2  pairs.  Given  the  i-th  and  J-th  factors,  consider  the  con¬ 
straints  in  the  order  obtained  by  looking  at  the  pairs  of  levels  in 

increasing  numerical  order  00,  01,  •••  ,  Os.,  10,  •••  ,  Is.,  •**  ,  s.O, 

j  J  * 

•  •  •  ,  s^Sj .  Then  if  the  pair  is  in  class  0,  cross  out  no  constraints; 

if  the  pair  is  in  class  1  cross  out  every  s.-th  constraint;  if  the 

J 

pair  is  in  class  2  cross  out  every  s^-th  constraint  and  the  last 
Sj  constraints . 


Now  suppose  t  is  equal  to  3«  A  triple  of  factors  is  in  class 
&  if  £2  of  the  3  pairs  have  occurred  in  a  previous  triple.  If  the 
triple  consisting  of  the  i-th,  J-th,  and  k-th  factors  is  in  class  £2 
and  if  fl  £  1,  cross  out  every  s^-th  constraint.  If  £2  i  2,  cross 
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out  every  s^-th  group  of  s^  constraints.  If  ft  »  3,  cross  out  the 


last  sjsk  constraints. 


In  general,  a  t-tuple  of  factors  is  in  class  ft  if  ft  of  the 

(t-l) -tuples  have  occurred  in  previous  t-tuples  when  written  in 

standard  alphabetic  order.  If  the  t-tuple  consisting  of  the  i^-th 

through  i^-th  factors  is  in  class  ft  and  if  ft  ^  then  cross  out 

the  last  II,  ,  _  s.  relationships  in  each  set  of  II.  .  ,  s. 

j=t-a>f2  i^  *  J»t«a*l  ij 

consecutive  relationships . 


There  are  (£)  t-tuples,  and  the  number  of  these  in  class  ft  is 
.  The  factors  in  a  t-tuple  in  class  ft  are  the  first 
through  (t-ft)-th  together  with  ft  of  the  factors  numbered  t-ft+2 
through  n.  The  number  of  constraints  left  after  crossing  out  in  a 
set  of  constraints  formed  by  considering  the  t-tuple  consisting  of 
factors  whose  ordinal  numbers  are  1,  •  •  *  ,  t-ft,  i^,  •  •  •  ,  ifl,  is 

Bl}  x  (si  -x)}.  The  total  number  of  constraints  for  t  =  2 

is  given  by  + 

and  for  t  =  3  ty 

W}  *  SlB2(ei1-1)  +  ^3^2-V1  Sl(ei1'l)(Si2'l) 

+  ^=2  *VV1  . 

The  general  expression  can  be  written  in  the  form 
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nLi  ai + ■* 


,t-k 


Bin\  „  where  the  set  0^  ^  is 

fc*  t  ,3  * 

the  set  of  indices  i^,  •••  ,  i^  such  that  t-k+1  <  i^  <  i^  <  •  •  • 

<l^in.  The  general  expression  derived  above  for  the  base -set 

method  is 

1  +  4l  \  ^-1  <S1  -«>  vhere  \  -  f^jf1  <  *1  <  S  <  <  **  *  n}‘ 

These  two  expressions  will  now  be  shown  to  be  equivalent. 


THEOREM  7*  For  all  t,  the  number  of  constraints  using  the  base- 
set  method  is  equal  to  the  number  of  constraints  using  the  crossing-out 
method. 


PROOF.  For  fixed  but  arbitrary  n  and  s^,  s2,  •••  ,  s^,  let 
g(t)  be  tie  number  of  constraints  using  the  base -set  method  for  a 
design  of  strength  t  and  let  h(t)  be  the  number  of  constraints 
using  the  crossing -out  method.  Ve  have  already  shown  that  g(z) 
equals  h{2)  .  We  will  show  that  for  all  t,  g(t)  -  g(t-l)  equals 
h(t)  -  h(t-l),  which,  by  Induction,  will  complete  the  proof.  The 
difference  g(t)  -  g(t-l)  is  given  by  2^  H^^Cs.^  -l),  where 

&t  m  {i  |l  s  <  ig  <  ...  <  i^  s  n}.  The  difference  h(t)  -  h(t-l) 
is  given  by 

nLi  8i +  b^\  A*  <8i -bJ  -  8i 

riLi  (*<  -i)}-  The  latter  may  be  rewritten 
j-j.  ij 


"  ^ni»i 1  8i^  &e'k  t  l  *>1 
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h(t)  -  h(t-l)  -  +  tig,  (.±  -X)  ),  where 

\  ■  (^1  *i^Ee'  (»i  *  ('£.1*  lfl±) ^i(si  • 

*t  *•  +  *>  J 

The  tern  Dq  reduces  to  s^(s^.-l)  .  Vhen  this  is  added  to  the 

first  term  of  the  result  is 

(££  »i){^? °  fii^Se*  CJol^si  “1^*  Subtracting 

"1  jL  1,  t— 1  j 

the  second  term  of  Dl^  ve  have 

D0  +  DL-  8i^8t-l“1^^0*  n^1(si  -2.J  } .  Adding  the  first  tern 

i  o  ij  t  “1  „  j 

of  D2  ve  obtain  6j[) fegi  from  vhich  is 

derived  I**  \  *  <n£l  sl>(V2-l){^*  +  AlK,  -l)  } .  Proceeding 

2 , t-1  j 

analogously,  the  summation  of  the  becomes 

Iiio  \  *  ^  iD5“^Si  ~1^-  We  wlJJL  now  add  0118 

to  the  final  term  of  h(t)  -  h(t-l),  namely  {£«,  ,  (s  -l)}.  The 

®t,t  ^mX  XJ 

index  sets  are  ^  «  {l  <  i^  <  1^  <  •••<  itl  <  n}  and 

•It  “  f1  <  *1  <  *z  <  •  •  •  <  i^  in}.  The  result  of  the  addition  is 

therefore  “UK,  -l),  where  is  the  set 

t  J 

{l  i  i^  <  ig  <  •••  <  i^  i  n},  vhich  is  equal  to  g(t)  -  g(t-l),  as 
vas  to  be  proved. 


Therefore,  the  base-set  method  and  the  crossing -put  method  are 
equivalent.  Although  the  base-set  method  is  used  for  the  example  in 
Section  3*  the  methods  are  equally  easy  to  apply,  and  either  may  be 
used  to  construct  a  set  of  linearly  independent  constraints  for  the 
application  of  linear  programming  to  the  derivation  of  designs . 
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APPENDIX  B 


CHARACTERIZATION  OF  NON-ORTHOCOXAL  INCOMPLETE  FACTORIAL  DS3IGK3 

SUMMARY 

Two  general  classes  of  non-orthogonal  incomplete  factorial  designs 
called  clumpwise-orthogonal  designs  and  permutation- invariant  designs 
are  defined.  The  cross-product  matrices  of  the  former  can  be  arranged 
to  contain  blocks  of  non-zero  elements  down  the  main  diagonal  and  zeros 
elsewhere.  The  latter  class  may  be  described  as  containing  designs  in 
which  those  factors  which  appear  at  the  same  number  of  levels  are  treated 
alike. 

A  review  of  the  existing  literature  on  non-orthogonal  designs  shows 
that  despite  the  large  quantity  and  variety  in  methods  for  construction, 
the  designs  all  fall  into  one  or  both  of  the  classes  defined  above. 

It  is  shown  that  both  classes  of  designs  possess  characterizations 
as  integer  solutions  to  sets  of  linear  constraints.  For  a  subclass 
called  special  clumpwise-orthogonal  designs,  defined  only  when  all  fac¬ 
tors  are  at  two  levels,  a  different  characterization  is  derived.  This 
characterization  involves  group-theoretic  considerations  similar  to 
those  used  in  the  classical  theory  of  fractional  factorials. 

1.  INTRODUCTION 

Orthogonal  incomplete  factorial  designs  have  been  used  for  many 
years.  In  many  experimental  situations,  however,  the  smallest  orthogonal 
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incomplete  design  may  entail  more  runs  than  the  experimenter  is  prepared 
to  make.  Recently  there  has  been  increased  interest  in  non-orthogonal 
designs,  on  vhich  there  is  no  a  priori  restriction  on  the  number  of  runs, 
and  many  methods  have  been  set  forth  for  constructing  such  designs. 

Criteria  for  comparing  designs 

Unlike  orthogonal  designs,  vhich  as  a  class  enjoy  certain  optimality 
properties,  non-orthogonal  designs  must  be  judged  individually.  First 
of  all,  the  design  must  have  at  least  the  required  resolution,  vhich  is 
defined  as  follovs  (Box  ft  Hunter,  1961) t  If  '•  is  even  and  a  design  is 
of  resolution  t+1  then  all  parameters  involving  -£-t  or  fever  factors 
are  estimable,  ignoring  parameters  involving  more  factors.  If  t  is 
odd,  parameters  involving  (t-l)/2  factors  are  estimable,  ignoring 
parameters  involving  (t+3)/2  or  more  factors.  Those  involving  (t+l)/2 
factors  are  neither  estimable  nor  comple tely  ignored. 

Ve  vill  suppose  that  a  particular  parameterization  has  been  decided 
upon.  (See  Vebb  (1963)  for  a  discussion  of  parameterizations  for  fac¬ 
torial  designs.)  Given  two  designs  vhich  have  the  same  resolution  and 
number  of  runs,  the  one  vhich  permits  estimation  with  smaller  variances 
vill,  generally  speaking,  be  more  desirable.  More  specifically,  four 
possible  criteria  for  optimality  of  a  design  ares 
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that  all  the  variations  are  sinlaizcd, 


i) 

ii)  that  the  trace  of  the  covariance  matrix  is  minimized, 
ill)  that  the  determinant  of  the  covariance  matrix  ie  minimized,  or 
iv)  that  the  average  variance  of  a  predicted  value  ie  minimised* 

The  first  criterion  is  preferable,  but  often  it  is  impossible  to  satisfy 
it  with  a  design  utilising  a  given  number  of  runs  and  a  parameterization 
specified  a  priori.  The  second  is  equivalent  to  minimizing  the  sum  of 
the  variances  of  the  estimates,  and  the  third  is  equivalent  to  minimising 
the  volume  of  a  confidence  ellipsoid  on  the  parameters  (Mood,  1946).  If 
the  parameterisation  is  scaled  in  an  appropriate  way,  criterion  (iv)  is 
equivalent  to  (ii);  more  generally  (iv)  will  reduce  to  a  weighted  average 
of  the  variances  of  the  estimates* 

Plackett  and  Burman  (1946)  showed  that,  subject  to  the  restriction 
that  the  lengths  of  the  columns  of  the  design  matrix  be  fixed,  orthogonal 
designs  satisfy  criterion  (i),  hence  also  (ii),  and  (iv)*  Without  this 
restriction,  however,  it  is  often  possible,  depending  on  the  parameteri¬ 
zation,  to  construct  non-orthogonal  designs  which  surpass  orthogonal 
ones  with  the  same  number  of  runs  under  any  of  the  last  three  criteria* 

In  another  paper  (Webb,  1964),  it  was  shown  that,  if  one  uses  a  main-effect 
parameterization  under  which  the  full  factorial  is  orthogonal,  criteria 
(iii)  and  (iv)  are  satisfied  if  and  only  if  the  design  is  also  orthogonal* 

v. 
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Yhereas  knowledge  of  the  conditions  under  which  orthogonal  designs 
are  optimum  is  as  yet  incomplete,  knowledge  of  suoh  conditions  for  non- 
orthogonal  designs  is  for  the  most  part  nonexistent.  The  few  isolated 
cases  in  which  any  optimality  property  has  been  demonstrated  for  a  non- 
orthogonal  design  are  indicated  in  subsequent  sections. 

Classification  of  non-orthogonal  designs 

The  non-orthogonal  designs  in  the  literature  fall  into  two  over¬ 
lapping  classes  wfc  ch  I  call  nermutation-lnvariant  and  cluarowiae- 
orthogonsi  designs.  ▲  permutation-invariant  design  involving  factors 
at  the  same  number  s  of  levels  is  one  for  which  if  abc...n  is  any 
treatment  combination  appearing  in  the  design,  where  the  symbols 
a,  b,  ...»  n  take  on  one  of  the  values  0,  1,  ...»  s-1,  then  all  the 
treatment  combinations  which  are  permutations  of  the  given  treatment  com¬ 
bination  also  appear  in  the  design.  For  the  case  in  which  all  factors 
do  not  appear  at  the  same  number  of  levels,  a  permutation-invariant 
design  is  one  which  contains  the  same  treatment  combinations  if  factors 
appearing  at  the  samo  number  of  levels  are  permuted.  Later  we  will  con- 
sider  permutation-invariant  designs  of  strength  t.  which  are  more 
general  designs  for  which  the  cross-product  matrix  is  not  altered  if 
factors  appearing  at  the  same  numbers  of  levels  are  permuted.  The 
parameter  t  is  twice  the  number  of  factors  in  the  highest-order  inter¬ 
action  to  be  estimated. 


A  clumpwifc»-cithogonal  design  is  on®  for  which,  by  rearranging  the 
columns  of  the  coefficient  matrix  X,  the  cross-product  matrix  can  b® 
arranged  so  that  ther®  are  square  suboatrices  of  non-zero  elements  down 
the  main  diagonal  and  zeros  elsewhere*  A  subclass  of  particular  interest, 
defined  for  the  case  whan  all  factors  appear  at  two  levels,  consist®  of 
the  special  clmnwiae>*orthogonal  designs,  which  have  all  non-zero  off- 
diagonal  elements  equal.  I  will  restrict  the  definition  of  clumpwise- 
orthogon&l  designs  by  requiring  that  there  be  at  least  two  clumps,  since 
otherwise  every  design  would  satisfy  the  definition.  It  is  convenient, 
how ever-  not  to  impose  this  restriction  on  the  class  of  special  cluap- 
wlse-orthogonal  designs. 

Both  the  permutation-invariant  and  clumpwise^rihogonal  classes 
contain  orthogonal  designs.  A  simple  example  of  &  design  which  is  not 
orthogonal  but  which  is  still  contained  in  both  classes  is  the  design 
for  two-level  factors  consisting  of  runs  at  treatment  combinations  00, 

00,  01,  10,  11,  and  11. 

2.  SURVEY  OF  lON-OmOOOKAL  SSSICM3 
It  is  felt  that  this  literature  survey  is  reasonably  exhaustive 
with  the  exception  of  quite  recent  work  in  the  area.  The  non-orthogen&l 
designs  covered  have  been  derived  using  many  different  devices  and  from 
widely  different  points  of  view.  Still,  they  all  may  be  categorised 
into  the  permutation-invariant  end  oluapwiae-crthogonal  classes. 
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Early  non~crthogonal  designs 


Perhaps  the  earliest  non-orthogonal  designs  were  the  optimum  weigh¬ 
ing  designs  proposed  by  Mood  (1946).  Suppose  p  objects  are  to  be 
weighed  on  a  single-pan  scale.  If  each  object  is  weighed  individually 
the  variance  of  each  estimated  weight  is  the  error  variance  associated 
with  each  weighing.  By  weighing  the  objects  in  appropriately  chosen 
groups,  the  variance  of  the  estimated  weights  may  be  decreased.  Using 
criterion  (iii)  (the  determinant)  to  judge  optimality.  Hood  showed  that 
if  pm  2k-l,  then  the  design  consisting  of  all  combinations  of  the  p 
objects  taken  k  at  a  time  is  optimum.  If  p  =*  2k,  he  showed  that  the 
design  which  consists  of  all  combinations  taken  k  at  a  time  together 
with  all  those  taken  k  +  1  at  a  time  is  optimum.  Banerjee  (1948) 
pointed  out  that  optimality  is  preserved  if  only  those  combinations 
which  comprise  a  balanced  incomplete  block  design  are  weighed.  The 
designs  of  Hood  and  Banerjee  are  both  permutation-invariant  and  special 
clumpwise-orthogonal  with  one  clump. 

Chakravarti  (1956)  introduced  the  "partially  balanced  array  of 
strength  t"  vhich  is  a  specialization  of  the  permutation-invariant 
design  of  strength  t  to  the  case  where  all  factors  are  at  the  same  num¬ 
ber  of  levels.  I  have  chosen  not  to  adopt  his  nomenclature  because  I 
believe  "permutation- invariant"  is  a  better  description  of  the  defining 
property  of  these  designs.  He  gives  two  simple  examples  of  such  designs. 
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on*  containing  runs  at  treatment  combination*  0000,  0111,  1011,  1101, 

4  5 

and  1110  for  a  2  experiment,  and  another  for  a  2  ,  containing  runs 

at  00000  and  all  treataent  combinations  with  four  factors  at  th*ir  high 
level* 

Morrison  (1956)  presented  an  interesting  technique  for  constructing 

incomplete  factorial  designs  in  the  case  where  all  factors  are  not  at 

the  sane  number  of  levels*  It  is  moat  effective  when  all  but  one  of 

them  are  at two  levels*  The  levels  for  the  many-level  factor  are  grouped 

into  pairs  (with  one  level  left  over  if  the  number  of  levels  is  odd)* 

A  standard  fractional  factorial  is  constructed  using,  in  turn,  each  pair 

with  all  of  the  two-level  factors*  To  illustrate,  consider  an  incom- 
4  1 

plete  2  5  design,  and  designate  the  levels  of  the  five-level  factor 
by  the  integers  0  through  4*  The  design  nay  be  constructed  in  three 
stages*  First,  identifying  levels  0  and  1  of  the  five-level  factor 
with  the  levels  of  an  (imaginary)  fifth  two-level  factor  E,  an  eight- 

5 

replicate  of  a  2  is  constructed  using  as  generators  in  the  defining 

contrast  AB,  CD,  and  ACE*  (For  definitions  see  Daniel  (1956)*) 

Kext,  levels  2  and  3  are  associated  with  3  and  another  eighth-replicate 
is  constructed  using  -A3,  -CD,  and  ACE  to  generate  the  defining 

4 

contrast*  Finally,  an  eighth-replicate  of  the  2  is  constructed, 
using  AD,  AC,  and  AD  to  generate  the  defining  contrast,  and  level  4 
of  the  five-level  faotor  ie  used  for  these  runs.  The  complete  design 
is  as  follows: 
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0  0  0  0  0 
11110 
0  0  111 
110  0  1 
0  10  12 
10  10  2 
0  110  3 
10  0  13 
0  0  0  0  4 
11114 

Morrison 'b  method  of  construction  leads  to  clumpwlse-orthogonal  designs. 
Irregular  fractional  factorials 

i  type  of  non-orthogonal  design  which  has  received  a  great  deal  of 
attention  is  the  irregular  fractional  factorial.  The  general  form  of 
such  an  incomplete  factorial  is  the  k/s*  fraction  of  the  sn  design. 
Saner jee  (1949)  discussed  (2m~l)/2tt  fractions  of  2n  designs  in  con¬ 
nection  with  the  weighing  problem.  Further  examples  were  given  by 
Kemp thorns  (1952)  and  by  John  (l96l).  Addelman  (l96l)  gave  a  rather 
detailed  analysis  of  such  designs  and  presented  a  catalogue  of  3/2® 
replicates  of  2n  designs.  Dykstra  (1959)  gave  a  catalogue  of  irregular 
fractional  factorials  which  involve  partial  duplication.  All  irregular 
fractional  factorials  are  clumpw  is  ©-orthogonal ,  and  those  for  the  2n 
case  are  special  olumpwise-crthogonal • 

Recent  methods  for  2*3a  experiments 

Several  workers  have  developed  techniques  for  deriving  incomplete 
3®  and  2D3n  designs »  all  of  which  lead  to  designs  which  are  both 
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permutation-invariant  and  clumpwise-orthogonal .  DeBaun  (1959)  developed 
a  series  of  response-surface  designs  for  thrao  indopandant  variables  which 
ara  raa trie tad  to  *ee  lavalu,  so  that  tha  designs  are  also  incomplete 
factorials.  The  Method  is  to  combine  tha  following  subdesigns:  the  center 
point  (treatment  combination  111),  the  octahedron  (Oil,  211,  101,  121, 

110,  and  112),  the  "cuboc tahedron"  (all  treatment  combinations  with  one 
l)  and  the  cube  (all  treatment  combinations  with  no  l).  DeBaun  gave 
analyses  for  designs  consisting  of  various  combination  of  these  subdesigns 
with  repetition  of  entire  subdesigns  permitted,  and  he  compared  their 
efficiencies  and  variance  contours  as  second-order  response-surface 
designs* 

Box  and  Behnken  (i960) ,  who,  like  DeBaun,  were  motivated  by  response- 
surface  considerations,  constructed  a  series  of  incomplete  factorials 
utilising  balanced  incomplete  block  configurations.  The  f motors  are 
associated  with  the  block  elements  (often  called  “varieties"  in  the 
literature  on  balanced  incomplete  blocks).  For  each  block  a  subdesign 
is  constructed  containing  a  two-level  design  using  levels  0  and  2  of  the 
factors  appearing  in  the  block,  with  all  factors  not  in  the  block  being 
held  at  level  1.  After  the  procedure  is  repeated  for  every  block  the 
complete  design  is  formed  by  combining  the  subdesigns  and  then  appending 
several  runs  at  the  center  point. 


37 


Connor  (i960)  developed  a  technique  for  handling  2*3°  designs  in 
which  several  conbinationa  of  standard  fractions  of  the  2*  and  of  the 
3&  are  combined  in  a  specified  Banner.  One  of  his  examples  is  the 
following  incomplete  2y3  design.  Let  the  two  half- replica tea  of  the 

3 

2  be  denoted  symbolically  by  3^  and  3^  and  the  three  third- 

2 

replicates  of  the  3  be  denoted  by  T^,  I2,  and  T^.  Let  S^T^  be 
the  design  formed  by  taking  all  possible  combinations  of  treatment  com¬ 
binations  in  3 
is  a  half  of  the  full  factorial. 

Another  method  for  forming  incomplete  factorial  designs  for  3° 

and  2a3n  experiments  was  given  by  Fry  (l96l).  The  full  factorial 

consists  of  a  center  point  (the  treatment  combination  11... l)  surrounded 

by  concentric  hyperspheres ,  the  r-th  containing  the  (&)2r  treatment 

r 

combinations  wit!:  exactly  r  non-i  components.  A  design  may  be  con¬ 
structed  by  combining  fractions  of  the  treatment  combinations  on  each 
hypersphere.  Fry  gives  an  example  of  an  incomplete  3*  design  formed 
by  taking  all  the  points  in  every  other  hypersphere  and  omitting  entirely 

the  others  (this  is  essentially  BeBaun's  method).  Since  the  (n)2r 

r 

points  in  the  r-th  hypersphere  may  be  considered  formally  as  (n) 

r 

separate  faotori/J.8  in  r  factors  at  two  levels*  standard  fractions  may 
he  taken  from  each.  Also*  if  the  design  contains  m  factors  at  two 
levels  as  well  as  the  n  three-level  factors,  fractions  of  the  2r*at 


with  those  in  T^.  Then  the  design 
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design  may  be  used.  Pry  gives  as  an  example  a  "half-replicate*  of  a 

15  2 

2  3  design  constructed  in  this  way.  He  points  out  that  this  method  of 
construction  leads  to  designs  which  are  quite  similar  to  those  derived 
by  Connor. 

It  is  interesting  to  note  that  DeBaun,  Box  and  Behnken,  Conn or,  and 
fry  used  disparate  methods  and  were  motivated  by  contrasting  considera¬ 
tions.  Tet  the  designs  obtained  are  all  permutation-invariant  and 
olumpwlse-orthogonal,  and  moreover  usually  possess  a  similar  clump 
structure. 

Expansible  contractible  designs 

In  work  which  has  not  previously  been  reported,  £•  V.  Last1  developed 
a  series  of  permutation-invariant  designs  for  factors  at  two  levels.  He 
was  motivated  by  the  need  for  designing  experiments  which  may  be  pre¬ 
maturely  halted,  due  for  example  to  a  breakdown  of  the  equipment.  His 
aim  was  to  design  the  experiment  to  study  first  what  are  considered  the 
most  important  factors  so  that  if  the  design  in  not  completed  inferences 
may  be  made  about  the  more  interesting  factors  conditionally  on  the  loss 
important  factors  being  fixed.  1  design  is  specified  as  the  set  of 
treatment  combinations  with  a,  b,  or  k  factors  at  the  high 

level.  Suppose  the  factors  are  arranged  in  increasing  order  of  impor¬ 
tance  and  the  treatment  combinations  are  run  in  increasing  numerical 

formerly  with  Hooketdyns,  now  with  Sylvania  Electric  Products,  Inc., 
Mountain  View,  California 


order  considered  ae  binary  numbers*  Then  the  design  will  have  the 
desired  property.  For  example,  consider  three  faotors  at  two  levels  and 
let  the  design  be  the  treatment  combinations  with  aero,  one,  or  three  of 
the  faotors  at  their  high  level.  If  the  combinations  are  run  in  the 
order  000,  001,  010,  100,  111,  then  after  the  first  two  observations 
the  "main  effect"  (conditional  on  the  other  faotors  being  fixed)  of  the 
last  and  most  important  factor  can  be  estimated.  After  the  third  the 
conditional  main  effects  of  the  latter  two  faotors  can  be  estimated, 
and  finally  after  the  fourth  all  three  main  effects  can  he  estimated. 

I  proposed  some  specific  permutatioxiHinvariant  designs  which  are 
included  in  the  general  class  considered  by  Last  (Webb,  1961).  These 
designs  were  (l)  the  incomplete  2^  consisting  of  one  run  at  each 
treatment  combination  containing  zero,  two,  or  five  faotors  at  the  high 
level;  (2)  the  incomplete  27  consisting  of  one  run  at  each  treatment 
combination  containing  zero,  two,  or  six  factors  at  the  high  level;  and 

ST 

(3)  the  incomplete  2*  consisting  of  one  run  at  each  treatment  combina¬ 
tion  containing  zero  or  two  factors  at  the  high  level  and  two  runs  at 
each  treatment  combination  with  six  faotors  at  the  high  level. 

The  general  class  of  proportional-frequency  designs  has  been  inves¬ 
tigated  by  Addelman  (1962).  Let  n^  be  the  number  of  occurrences  of 
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the  1-th  level  of  one  factor  with  the  j-th  level  of  another,  r4  #  he 

the  number  of  occurrencea  of  the  1-th  level  of  the  first  factor,  n#j 

be  the  number  of  occurrencea  of  the  J-th  level  of  the  second,  and  S 

be  the  number  of  runs*  A  design  is  a  proportional-frequency  design  if, 

for  every  pair  of  factors  and  all  levels  i  and  j,  n^H  equals 

n.  n 
i* 

If  the  grand  mean  is  defined  as  the  expected  value  of  the  average 
of  the  observations  made,  then  proportional-frequency  designs  are 
orthogonal  for  estimating  main  effects  and  the  grand  mean.  If,  however, 
the  grand  seen  is  defined  as  the  average  ct  the  expected  values  of  all 
treatment  combinations  of  the  full  factorial,  then  proportional-frequency 
designs  are  non-orthogonal.  While  neither  the  class  of  proportional- 
frequency  designs  nor  the  class  of  permutation- invariant  designs  in¬ 
cludes  the  other,  the  designs  which  Addelman  catalogues  when  considered 
as  non-orthogonal  designs  are  all  contained  in  both  classes. 

3.  A  CHIRAC TERIZATIOW  OF  SPECIAL 
CLUMPWISE-ORTHOGONAL  DSSIGK3 

Both  permutation-invariant  and  elumpwise-orthogonal  designs  may  be 
characterised  as  integer  solutions  to  linear  programming  problems. 

Methods  for  constructing  the  constraint  matrices  for  the  general  classes 
are  given  in  the  fourth  and  fifth  sections.  In  the  case  of  special 
olumpwlse-orthogonal  designs,  defined  only  for  the  211  experimental 
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situations,  a  characterization  is  possible  which  ie  very  differont  from 
that  for  the  general  class  of  clumpwise-orthogonal  designs.  This  charac¬ 
terization,  involving  groups  of  treatment  combinations,  is  reminiscent 
of  the  classical  theory  of  fractional  factorial  designs. 

Outline  of  the  characterization 

In  the  case  of  2n  experiments,  if  a  resolution  t  +  1  design  is 
orthogonal,  then  for  every  subset  of  t  factors  every  combination  of 
levels  occurs  X  times  (Vebb,  1963)*  As  I  will  show  in  this  section, 
if  a  resolution  t  1  design  is  special  clunpwise-orthogonal,  then 
for  every  subset  of  t  factors  there  exist  constants  X^  and  Xg  such 
that  every  combination  of  levels  occurs  either  ^  or  ^  times,  and 
those  appearing  X^  times  form  a  group.  The  proof  will  proceed  by 
showing  that,  for  each  subset  of  t  factors  of  a  special  clumpwise- 
orthogonal  design,  either  a  group  of  treatment  combinations  or  the 
complement  of  a  group  can  be  added  enough  times  to  make  the  resulting 
augmented  design  orthogonal.  In  order  to  pursue  this  method  of  proof 
three  preliminary  results  must  be  established.  First  we  must  show  that 
if  a  set  of  treatment  combinations  forms  a  group  under  an  appropriately 
defined  operation,  then  it  is  a  clumpwise-orthogonal  design.  Next  we 
will  show  that  any  special  clumpwise-orthogonal  design  in  t  factors 
leads  to  the  same  clump  pattern  as  a  group.  Finally  it  must  be  estab¬ 
lished  that  the  non-zero  off-diagonal  entry  of  a  special  clumpwise- 
orthogonal  design  in  t  factors  is  such  that  the  group  with  the  same 
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clump  pattern  or  its  complement  can  be  added  an  appropriate  integral 
number  of  times  with  the  result  that  the  augmented  design  is  orthogonal. 

Grouna  of  treatment  combinations 

Fisher  (1942)  showed  that  the  set  of  treatment  combinations  in  a 
2s  experiment  form  a  group.  Any  given  treatment  combination  may  be 
designated  as  the  group  identity,  and  as  such  it  is  denoted  by  the 
symbol  (l).  Fisher  makes  no  commitment,  but  later  authors  have  chosen 
to  let  that  treatment  combination  in  which  every  factor  appears  at  its 
low  levs!  be  the  identity.  It  will  be  more  convenient  to  break  with 
tradition  and  designate  as  the  identity  that  treatment  combination  with 
every  factor  at  ita  high  level. 

To  each  factor  let  there  he  assigned  a  letter  a,  b,  and  let 

every  treatment  combination  other  than  the  identity  be  denoted  by  the 
letters  corresponding  to  the  factors  at  their  low  level.  The  group 
operation  ie  defined  as  follows *  the  resultant  of  two  treatment  combi- 
cations  under  the  group  operation  is  that  treatment  combination  which 
contains  all  the  letters  from  the  two  omitting  the  letters  they  have  in 
common. 

Every  treatment  combination  except  the  identity  divides  the  group 
into  two  parte,  those  treatment  combinations  having  an  even  number  of 
letters  in  common  with  the  given  treatment  combination,  and  those  having 
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an  odd  number  in  common.  The  "even  half  contains  the  identity  (l)  and 
is  itself  a  subgroup.  The  set  which  has  an  even  number  of  letters  in 
common  with  each  of  two  given  treatment  combinations  will  again  be  a 
subgroup,  which  will  also  have  an  even  number  of  letters  in  common  with 
both  the  product  of  the  two  given  treatment  combinations  and  the  identity. 
Fisher's  general  result  is  that  the  set  of  elements  even  for  every  ele¬ 
ment  of  a  subgroup  of  order  is  a  subgroup  of  order  2n~ps  where 
n  is  the  total  number  of  factors.  These  two  subgroups  are  said  to  be 
orthogonal. 

The  sot  of  all  possible  parameters  can  also  be  considered  as  form¬ 
ing  a  group.  The  grand  mean,  usually  denoted  1  in  this  context,  is 
the  identity.  The  main  effects  of  the  factors  are  denoted  by  ▲,  B,  C, 
and  so  on.  Interactions  are  denoted  by  the  appropriate  combination  of 
two,  three,  or  more  letters  which  belong  to  the  factors  interacting. 

The  product  of  two  symbols  under  the  group  operation,  called  the 
generalised  interaction  of  the  two  symbols,  is  that  symbol  which  contains 
all  the  letters  of  the  two  omitting  any  they  may  have  in  common.  The 
identity  1  is  considered  as  containing  no  letters.  Thus,  the  general¬ 
ised  interaction  of  main  effects  A  and  B  is  iB,  which  is  a  genuine 
interaction.  The  generalized  interaction  of  interactions  BD  and  BBS 
is  the  main  effect  K.  Hie  group  of  parameters  is  isomorphic  to  the 
group  of  treatment  combinations,  so  that  Fisher's  result  applies  to 
subgroups  of  the  parameter  group  as  well. 


If  a  design  la  constructed  whoa®  treatment  combinations  fora  a  sub¬ 
group  of  order  2P  or  coast  of  a  subgroup  of  order  2*\  the  design  will 
be  a  fractional  factorial,  that  is*  one  block  of  a  confounded  design* 

The  parameters  aliaeed  with  the  grand  mean  X  are  just  those  parameters 
in  the  subgroup  orthogonal  to  the  group  of  treatment  combinations  used* 
The  parameters  aliased  with  any  given  parameter  are  the  elements  of  a 
coset  of  the  orthogonal  subgroup,  where  the  ooset  is  formed  using  the 
given  parameter*  The  entire  group  of  2  parameters  is  divided  into 
2P  alias  seta,  each  containing  2q  *  2n~*>  parameters*  There  is  an 
estimable  linear  combination  of  the  2^  parameters  in  an.  alias  set,  and 
due  to  the  particular  choice  of  the  identity,  this  estimable  linear  com¬ 
bination  is  just  t'  w  sum  of  the  parameters* 


The  cross-product  matrices  of  subgroups  of  treatment  combinations 

Consider  a  coefficient  matrix  whose  rows  correspond  to  a  subgroup  of 
treatment  combinations  and  which  has  columns  corresponding  to  every  pos¬ 
sible  main  effect  and  interaction*  The  cross-product  matrix  formed  from 
this  coefficient  matrix,  which  will  be  ccJLled  the  complete  cross-product 
matrix  associated  with  the  subgroup,  will  of  course  be  singular  unless 
the  subgroup  considered  is  in  fact  the  group  itself* 


THEOREM  1*  The  complete  cross-product  matrix  of  a  subgroup  of 
treatment  combinations  of  order  2^  is  special  clumpwiae* orthogonal, 
and  all  non- zero  entries  are  2^. 
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PROOF,  Finney  (1945)  shoved  that  a  subgroup  defines  an  aliased 
design  whose  alias  structure  is  determined  by  the  alias  subgroup  (this 
fact  is  the  basis  of  the  classical  theory  of  fractional  replication) « 
The  coefficient  vectors  for  aliased  parameters  are  all  equal,  and  co¬ 
efficient  vectors  for  parameters  which  are  not  aliased  are  orthogonal. 
Therefore,  the  complete  cross-product  matrix  consists  of  2n“’p  clumps, 
where  n  is  the  number  of  factors,  and  each  clump  has  all  elements 
equal  to  2P. 

THEOREM  2.  The  complete  cross-product  matrix  associated  with  all 
treatment  combinations  except  those  of  a  subgroup  of  order  is 

special  clumpwise-orthogonal ,  with  diagonal  entries  2n  -  2P  and  non¬ 
zero  off-diagonal  entries  equal  to  -2P. 


form 


PROOF. 

X 

hr 


Represent  the  coefficient  matrix  of  the  full  factorial  in  the 
where  corresponds  to  the  part  of  the  design  whose 


treatment  combinations  form  a  group  and  where  corresponds  to  the 

t  i 

remainder.  The  complete  cross-product  matrix  is  X^X^  +  ^2^2*  ve 

know  already  that  it  is  diagonal  with  diagonal  entries  equal  to  2a. 


The  matrix  is  lmown  to  be  special  clumpwise-orthogonal  from 

i 

Theorem  1,  so  that  X^X^  must  also  be  special  clumpwise-orthogonal  with 
diagonal  entries  2n  -  2P  and  non-zero  off-diagonal  entries  equal  to 
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Let  X  be  the  coefficient  matrix  of  a  special  clumpwiae-crthcgon&l 


design  of  resolution  t  +  1  in  n  factors*  All  the  elements  of  the 

« 

cross-product  matrix  X  X  are  of  course  inner  products  of  pairs  of 
columns  of  X*  Denote  by  the  column  of  X  which  is  the  coefficient 
rector  for  the  grand  mean,  by  r^,  x^,  x^9  •  •*  the  coefficient  rectors 
for  the  main  effects,  by  x^g,  x^,  •«*,  Xg^,  ««•  the  coefficient 
rectors  for  the  two-factor  interactions,  and  so  forth*  Because  of  the 
nature  of  coefficient  rectors  for  the  2n  experimental  situation,  the 
squared  length  or  erery  column  of  X  is  H,  the  number  of  runs* 

Because  of  the  requirement  that  the  design  be  of  the  special  clumpwise- 
orth agonal  form,  the  inner  product  of  a  pair  of  distinct  columns  is 
either  zero  or  a  single  specified  non-zero  number,  say  c*  Since  the 
components  of  coefficient  rectors  for  interactions  are  products  of  com¬ 
ponent*  of  the  coefficient  rectors  for  the  main  effects  of  Idle  factors 


making  up  the  interactions,  many  inner  products  must  be  equal*  For 
example,  the  inner  product  (x^,  x^g)  must  be  equal  to  (x^ 
t©  (*cst  Zjubc]))?  BJU8t  be  equal  to  (xg,  x^)  and  to 

(xgj  3c^g)«  In  fact,  the  inner  products  of  pairs  of  x's  whose  subscripts 
hare  the  same  generalized  interaction  will  be  equal*  In  the  two  examples 
above,  the  common  generalized  interaction  are  AB  in  the  first  case  and 
ABC  in  the  second*  Such  a  generalized  interaction  cannot  involve  more 


,  Xg)  and 


than  t  letters  since  the  oross-product  matrix  comes  from  a  resolution 
t  +  1  design* 
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Since  nil  pairs  of  x's  whose  subscripts  haws  the  sane  generalized 
interaction  have  the  same  inner  product,  we  will  discuss  clumpwise- 
orthogonal  designs  in  terns  of  which  generalized  interactions  give  rise 
to  sets  of  non-zero  inner  products*  If  a  generalized  interaction  gives 
rise  to  non-zero  inner  products,  sons  olunping  is  induced  in  the  cross- 
product  matrix*  To  find  what  effects  or  interactions  are  clumped  with 
a  given  effect  or  interaction, 1  it  is  only  necessary  to  multiply  the 
generalized  interaction  by  the  lettere  corresponding  to  the  given  effect 
or  interaction*  Tor  example,  if  the  generalized  interaction  AB  gives 
rise  to  non-zero  inner  products,  by  multiplying  AB  by  A  we  find  that 
Xg  must  appear  in  the  same  clump  as  x^»  Continuing  with  our  example, 
if  the  cross-product  matrix  is  that  of  a  design  of  resolution  5  or  more, 
then  Xj  and  are  in  the  same  clump*  If  the  cross-product  matrix 

is  that  of  a  design  of  resolution  7  or  more,  then  xQ  and  x^,  x^ 
and  and  so  on  would  be  clumped  together  in  pairs.  The  reason 

for  the  requirement  that  the  resolution  be  large  enough  is  that  only 
interactions  involving  {i  or  fewer  factors  are  represented  in  the 
cross-product  matrix  of  a  resolution  t  +  1  design,  so  that  in  the 
second  oaae  above  x^  would  not  appear  if  the  design  were  not  treated 
as  a  design  of  at  least  resolution  5* 

1  •  Aliasing  is  a  limiting  case  of  clumping  in  which  an  off-diagonal 
element  is  equal  in  absolute  value  to  the  diagonal  elements*  It 
should  be  noted  that  in  the  intermediate  case  when  the  off-diagonal 
element  is  in  absolute  value  less  than  the  diagonal  elements,  the 
corresponding  parameters  are  in  no  sense  "partially  aliased"  or 
"partially  confounded".  Bather,  they  are  separately  or  jointly 
estimable* 
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If  more  than  one  generalized  interaction  gives  rise  to  inner  products 
equal  to  a,  then  their  product  must  also,  if  this  product  contains  t 
or  fever  letters*  To r  example,  if  the  generalized  interaction  A  and  B 
both  give  rise  to  inner  products  equal  to  or*  then  z^,  x^,  and  Xg 
are  in  the  same  clump,  so  that  (x^,  Xg)  must  also  be  equal  to  a,  which 
implies  that  all  inner  products  of  vectors  whose  generalized  interaction 
is  AB  will  also  equal  a*  These  results  ore  proved  generally  in  the 
Appendix  as  Theorem  3,  which  states  that  if  two  generalized  interactions 
give  rise  to  inner  products  equal  to  a,  and  if  their  product  contains 
t  or  fever  letters,  then  their  product  most  also  give  rise  to  inner 
products  equal  to  or*  The  following  theorem  is  an  immediate  corollary 
of  Theorem  3* 

TH8QBBM  4*  Tor  every  subset  of  t  factors  in  a  special  clumpwise- 
orthogonal  design  of  resolution  t  f  1,  the  generalized  interactions 
giving  rise  to  non-zero  inner  products,  together  with  I,  form  a  group* 

It  follows  from  Theorem  4  that  for  any  subset  of  t  factors  the 
coaplete  cross-product  matrix  has  the  same  clump  structure  as  the  com¬ 
plete  cross-product  matrix  of  a  group.  If  the  order  of  the  group  of 
generalized  interactions  mentioned  in  Theorem  4  is  2^,  then  the  com¬ 
plete  cross-product  matrix  consists  of  2?  ■  clumps  each  containing 

2q  parameters*  The  parameters  in  the  clump  with  the  grand  mean  are  just 
those  generalized  interactions  which  give  rise  to  non-zero  inner  products. 
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so  that  tha  parameters  in  this  clump  form  a  subgroup  of  parameters •  The 
parameters  in  the  other  clumps  are  coasts  of  this  subgroup*  We  know 
<0 ready  that  there  is  a  subgroup  of  treatment  combinations  whoso  complete 
cross-product  matrix  has  exactly  the  same  clump  structure  as  the  given 
olumpwiee-orthogonal  design*  To  be  specific,  it  is  that  treatment-com¬ 
bination  subgroup  which  is  orthogonal  to  the  subgroup  of  parameters  in 
the  clump  which  contains  the  grand  mean* 

Value  of  non-zero  inner  products 

The  value  a  of  the  non-zero  inner  products  is  related  to  the  clump 
structure  according  to  the  following  theorem*  The  proof  is  given  in  the 
ippendix. 

THBOBJBM  5.  In  a  special  clumpwise-orthogonal  design  of  resolution 
t  +  1,  if  for  a  aubaet  of  t  factors  the  group  of  generalized  inter¬ 
actions  giving  rise  to  non-zero  inner  products  is  of  order  2**,  then 
o  is  an  integral  multiple  of  2** 

We  are  now  ready  to  characterize  special  clumpwise-orthogonal 
designs  of  resolution  t  +  1  containing  only  t  factors.  The  general 
ease  of  n  factors  will  be  discussed  after  the  theorem* 

THBCKEH  6*  A  resolution  (t  +  l)  design  in  t  factors  with  t 
even  is  special  clumpwise-orthogonal  if  and  only  if  there  exist  constants 
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sad  Xg  such  that  every  treatment  combination  in  a  group  of  treat¬ 
ment  combinations  appears  '  times  and  every  other  treatment  combina¬ 
tion  appears  times. 


PROOF.  The  cross-product  matrix  formed  from  a  partitioned  coefficient 


matrix 

numbers 


I  is  given  by  [x^  +  X^X2J*  That  the  stated  condition  on 
of "treatment  combinations  implies  clumpviae-orihogonality  is 


obvious  from  Theorems  1  2.  Suppose  a  design  containing  V  runs  is 

cluapvise-orthogonal,  that  the  group  of  generalized  interactions  giving 
rise  to  non-zero  inner  products  is  of  order  2^v  and  that  the  number  of 
clumps  is  2P  *  2t“q.  Denote  the  coefficient  matrix  of  the  given  special 
cluapvise-orthogonal  design  by  X^,  and  denote  by  the  coefficient  matrix 
cf  that  group  of  treatment  combinations  which  has  the  same  clump  structure 
as  the  given  design.  The  group  must  be  of  order  2**.  Let  a  m  c  •  2^, 
where  c  is,  by  Theorem  5,  a  non-zero  integer. 

CASE  I.  a  <  0.  Consider  a  new  design  consisting  of  all  treatment 
combinations  of  the  original  design  augmented  with  the  group  of  treatment 
combinations  replicated  -c  times.  The  cross-product  matrix  of  the 

d“lgn  18  hh  -  0  *  Vz*  •q“a'  -  °  •  **> 

the  identity  matrix  of  appropriate  size.  The  augmented  design  is 
orthogonal,  hence  has  every  treatment  combination  replicated  the  same 
number  of  times,  namely  (l  -  c  •  2?)/2*,  Therefore  the  original  design 
must  have  been  of  the  specif  ied  form  with  X^  equal  to  &  +  o. 


m 


mad  X,  equal  to  S  ~  ft.  *■  **■■*  The  total  number  of  treatment  combina- 

2  oP 

tions  in  the  original  design  is  +  o)  +  (2*  - 2P) (£-"■*/  ?-) , 

2*  2* 

whioh,  as  required*  equals  B* 

CASK  II*  or  >  0«  If  the  original  design  is  augmented  by  adding  the 
complement  of  the  gro^p  o  times ,  then  the  resulting  design  is  an 
orthogonal  design  with  every  treatment  combination  replicated 

. ft"  ~mjR)  times*  Hence  the  original  design  was  of  the  specified 

for.  witt  X,  .  ^  ^  .  »■*  A  (2*  -  2P)  .  e. 

ttctensloa  to  n  variables 

According  to  Theorem  6 ,  a  necessary  condition  -Quit  a  resolution 
t  +  1  design  in  n  variables  be  special  clumpwlsc  -orthogonal  is  that 
every  sub-design  containing  only  t  of  the  factors  have  the  structure 
specified  in  Theorem  6*  It  ie  necessary  to  impose  some  additional  con¬ 
dition  to  make  the  converse  true* 

As  an  example  of  a  design  for  which  the  converse  does  not  apply, 
consider  the  following  coefficient  matrix  and  corresponding  cross-product 
matrix* 
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1  -1  -1  -il 
1  -1  -1-1 
1-1-1  1 
-  1-111 

1  "  1  1-1-1 

111-1 
1111 
1111 

Denote  the  four  parameters  by  X»  A*  B,  and  C  as  before*  By  consider¬ 
ing  the  subset  consisting  of  factors  one  and  two,  the  parameters  A  and 
B  are  clumped.  Similarly,  consideration  of  the  second  and  third  factors 
shows  that  B  and  C  are  clomped  •  The  sub-design  consisting  of  the 
first  and  third  factors  E  however,  is  fully  orthogonal,  so  that  A  and  C 
do  not  appear  in  the  sane  clusp.  In  this  ease  the  oluap  structures  of  the 
three  possible  sub-designs  are  not  mutually  consistent. 

The  general  characterisation  now  follows  from  Theorem  6. 

THEOREM  7.  Consider  a  resolution  t  +  1  design  in  n  factors  with 
t  even.  If 

a)  for  every  subset  of  t  factors  there  exist  constants  and 
*2  satisfying  the  condition  of  Theorem  6, 

b)  the  clusp  structures  of  the  sub-designs  are  mutually  consistent, 
and 

c)  the  structures  of  the  sub-designs  yield  the  same  non-sero  off- 
diagonal  element, 

then  the  design  is  special  clumpviee-orthogon&l. 


(XX) 


8  0  0  0 
0  8  4  0 
0  4  8  4 
0  0  4  8 
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4.  THE  GENERAL  CLASS  Of 
CLUMPWISE-OSTHOGOHAL  LESIONS 

While  It  is  not  as  simple  or  Attractive  a  characterisation  as  that 
just  discussed,  general  clunpwise-orthogonal  designs  say  be  characterised 
as  integer  solutions  to  sets  of  linear  constraints.  Given  a  set  of  con¬ 
straints  for  a  fully  orthogonal  design,  removal  of  one  or  more  constraints 
yields  a  set  of  constraints  for  olumpwise-orthogonal  designs.  Alterna¬ 
tively,  given  a  clump  structure,  a  set  of  constraints  may  be  written 
down  expressing  the  orthogonality  remaining  in  the  design. 

Deletion  of  constraints 

For  discussion  of  the  way  in  which  the  removal  of  constraints  in¬ 
duces  clumps  in  the  cross-product  matrix,  it  will  be  helpful  to  use  a 
2V  design  as  an  illustrative  example.  As  usual,  let  the  levels  of 
the  two-level  factors  be  designated  by  0  and  1  and  the  levels  of  the 
three-level  factors  by  0,  1,  and  2.  Treatment  combinations  will  be 
written  with  the  two-level  factors  appearing  first.  The  symbol  [ijki] 
will  be  used  to  designate  the  number  of  times  treatment  combination 
ijkA  appears  in  a  design;  [ijkx]  will  denote  the  sum  [ijkO]  +  [ijkl]  ♦ 
[ijk2],  and  similarly  for  an  x  substituted  into  other  positions. 

It  has  been  previously  shown  (Vebb,  1963),  that  a  design  is  an 
orthogonal  design  of  resolution  t  1  (or  strength  t)  if  and  only  if 
for  every  subset  of  t  factors,  every  combination  of  levels  appears  the 


same  number  of  tines.  ▲  linearly  independent  set  of  constraints  for  a 
strength  2  design,  obtained  by  the  "base-set  method"  (Vsbb,  1963),  is 
given  in  Table  I. 

Ve  wish  to  find  out  shat  clumping  is  induced  in  the  cross-product 
matrix  if  the  first  constraint  is  deleted.  Since  the  constraints  in  the 
table  are  linearly  independent,  none  of  the  regaining  constraints  is 
affected.  It  can  be  verified  that  for  all  permissible  ij  combinations, 
the  following  constraints  still  hold:  [ixjx]  -  6X,  [ixxj]  »  6X, 

[xijx]  »  6X.  [xixj]  ■  6X,  and  [xxij]  »  4X.  Therefore,  for  the  subset 
consisting  of  the  first  two  factors  the  condition  for  orthogonality  no 
longer  holds,  but  for  any  other  subset  it  is  unaffected.  It  follows  by 
the  theorem  on  orthogonality  previously  stated  that  the  design  is 
orthogonal  with  the  exception  that  the  main  effects  of  the  two-level 
factors  are  clumped.  The  effect  on  the  clump  structure  of  removal  of 
other  constraints  or  sets  of  constraints  can  be  found  in  an  analogous 
manner. 
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Sable  1 


[00 xx]  »  9X 
[OzOz]  «  6X 
[Oxlx]  «  6X 
[OrsO]  -  6X 
[tel]  -  6X 
[0zx2]  *  6X 
[xOOx]  6X 
[xOlx]  -  6X 
[xOxOj  -  6X 
[xOxl]  «  6X 


[xOx2]  m  6X 
[zxOO]  «  4X 
[xxOl]  *  4X 
[aosC2]  *  4X 
[xxio]  -  4X 
[xzXl]  *  4X 
[xx!2]  «  4X 
[xz20]  -  4X 
[zz2l]  ■»  4X 
[xz22]  «  4X 


Constraints  for  orthogonal  2252  design 


The  practice  of  removing  one  or  more  specific  constraints  from  those 

for  a  fully  orthogonal  design  cannot  yield  constraints  for  all  clumpwise- 

orthogonal  designs ,  for  it  may  be  that  constraints  for  a  particular  clump 

structure  can  only  be  obtained  by  removing  a  particular  linear  combination 

2  2 

of  constraints*  For  example,  for  the  2  3  experiment  considered  pre¬ 
viously,  constraints  for  the  design  with  only  the  main  effect  of  the 
first  factor  clumped  with  the  mean  cannot  be  constructed  in  the  former 
manner.  The  alternative  approach  is  to  start  with  &  clump  structure  and 
write  down  constraints  reflecting  the  orthogonality  remaining  in  the 
design. 

To  illustrate  the  procedure,  the  constraints  for  a  particular  clump 

structure  for  2232  designs  will  be  derived.  Denote  the  grand  mean  by 

X,  the  effects  of  the  two-level  factors  by  ▲  and  B,  and  the  linear 

2  2 

and  quadratic  effects  of  the  three-level  factors  by  C,  C  ,  D,  and  D  • 
The  clump  structure  to  be  considered  is  described  by  the  following  dia¬ 
grammatic  representation  of  the  cross-product  matrix; 


107 


Here  the  name  of  a  parameter  (I,  A,  B,  etc*)  is  used  to  denote  the 
squared  length  of  the  associated  coefficient  vector  in  the  coefficient  • 
matrix.  The  symbol  I  denotes  a  non-zero  off-diagonal  element. 

For  this  clump  structure,  any  subdesign  consisting  of  a  tiro— level 

and  a  three-level  factor  is  completely  orthogonal.  The  corresponding 

constraints  are  those  requiring  that,  for  every  subset  consisting  of  one 

factor  at  two  and  one  at  three  levels,  every  possible  combination  of 

levels  occurs  the  same  number  of  times  X.  The  corresponding  constraints, 

reduced  to  a  linearly  independent  set,  are  the  first  fifteen  constraints 

2 

given  In  Table  IX.  In  addition,  C  is  required  to  be  orthogonal  to  D 
2 

and.  If  to  C  .  The  components  of  the  coefficient  vector  for  C  are 

-1,  Oj  and  1  when  C  is  at  levels  0,  1,  and  2  and  similarly  the  com- 

2 

ponenia  of  the  coefficient  vector  for  D  are  1,  -2,  and  1  when  D  is 
at  levels  0,  1,  and  2.  Therefore,  the  inner  product  of  the  coefficient 
vectors  for  C  and  D2  is  -[xxOO]  +2[rr0l]  -  [xx02]  +  [xx20]  -2[xx2l]  4* 
[xx22]«  The  final  two  conetrainte  in  Table  II  are  obtained  by  setting 

equal  to  zero  the  inner  product  of  the  coefficient  vectors  for  C  and 

2 

1)  and  likewise  the  inner  product  of  the  coefficient  vectors  for  D 
and  C2. 

Other  clump  structures  for  other  designs  can  be  handled  in  an  ana¬ 
logous  manner.  This  concludes  the  discussion  of  clumpwise-orthogonal 


Table  II 

[OzOx]  *  X 

[xOOx]  =  X 

[irOx]  *  X 

[Oxlx]  -  X 

[xOlx]  «  X 

[ixlx]  *  X 

[0x2x]  -  X 

[x02x]  -  X 

[lx2x]  *  X 

[OxxO]  *  X 

[xOxO]  *  X 

[lxxO]  «*  X 

[Oxxl]  -  X 

[xOxl]  *  X 

[lxxl]  -  X 

[xxOO]  +2[xx0l] 

-  [xx02]  ♦  [xx2l] 

-2[xx2l]  +  [xx22]  «  0 

[xxOO]  +  [xx02]  +2[xxl0]  ~2[xxl2] 

-  [xx20]  +  [xx22]  «  0 

2  2 

Constraints  for  2  3  clumpwiao-orthogonal  design* 
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5.  THE  CLASS  OF  PESMOTATIOU- 
IHVARIAMT  BESIISJS 


The  construction  of  constraints  for  elumpwiee-orthogonal  designs 
involves  weakening  the  constraints  for  orthogonal  designs  by  deleting 
linear  combinations  of  constraints «  The  constraints  for  permutation- 
invariant  designs ,  on  the  other  hand,  are  obtained  by  introducing  more 
indices*  Before  shoving  hov  this  is  accomplished,  some  additional  general 
material  on  this  class  of  designs  will  be  presented* 

Permutation-invariant  designs  of  strength  t 

The  appealing  property  of  permutation-invariant  designs  is  that  factors 
with  the  same  number  of  levels  are  treated  alike  in  the  sense  that  the 
estimates  of  corresponding  main  effects  have  the  same  variances  and  the 
same  covariances  vith  other  estimates*  For  this  to  hold  in  a  design  of 
resolution  t  +  1,  it  is  not  necessary  that  the  design  be  permutation- 
invariant  as  a  whole,  but  only  that  all  eubdesigns  containing  t  factors 
be  permutation-invariant  (this  follows  from  the  faot  that  the  cross- 
product  matrix  contains  inner  products  of  interactions  involving  up  to 
•$- 1  factors,  so  that  the  largest  number  of  factors  involved  in  computing 
any  element  of  this  matrix  is  t)*  Therefore,  it  is  convenient  to  define 

t 

formally  a  more  general  class  of  designs,  permutation-invariant  designs 
of  strength  t* 
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Given  n  factors,  select  any  ordered  eubset  of  t  factors  and  let 
f(i^s  ±2s  ...,  i^)  be  the  number  of  oocurrences  of  level  of  the 
first  factor  of  the  subset  with  level  ±2  of  the  second,  ...,  with  level 
of  the  *r-th.  Suppose  t  is  the  largest  value  of  <r  such  that  the 
function  f(i^,  ±2,  i^)  depends  on  the  number  of  levels  of  the 

factors  chosen  to  be  first  through  r-th,  but  not  on  the  particular 
choice  of  ordered  subset.  Then  the  design  is  a  permutation-invariant 
design  of  strength  t.  It  should  be  noted  that  the  permutation-invariant 
designs  mentioned  in  the  literature  survey  are  all  of Uhll  strength;  that 
is,  they  are  of  strength  n  where  n  is  the  number  of  factors. 

Several  permutation-invariant  designs  can  be  shown  to  satisfy  one 

or  another  of  the  criteria  for  optimality  given  in  the  introduction.  It 

4 

can  be  shown  by  an  enumeration  of  all  possible  five-run  incomplete  2 
designs  that  the  design  given  by  Chakravarti  (1956)  consisting  of  runs 
at  treatment  combinations  0000,  0111,  1011,  1101,  and  1110  is  the  essen¬ 
tially  unique  resolution  3  design  which  satisfies  criterion  (i)  (the 
variances  are  each  minimized).  By  "eHsentiaily  unique11  I  mean  that  the 
only  other  designs  with  this  property  are  those  derived  from  the  given 
design  by  interchanging  for  some  of  the  factors  the  designation  of  which 
is  the  high  end  which  is  ths  low  level.  Th©  complement  of  this  design, 
that  is,  the  design  consisting  of  runs  at  the  other  eleven  treatment  com¬ 
binations,  is  the  essentially  unique  eleven-run  resolution  5  design  which 
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satisfies  criteria  (ii),  (iii),  and  (iv)  (the  trace,  determinant,  and 
average  predicted  value  are  minimised) ,  The  twelve-run  design  obtained 
by  adding  a  run  at  the  treatment  combination  0000  is  still  permutation- 
invariant  and  also  satisfies  criteria  (ii),  (iii),  and  (iv)  among  twelve- 
run  designs  (Vebb,  1961) •  It  is  not  unique  in  this  respect,  however, 
there  being  three  other  twelve-run  designs,  not  essentially  equivalent, 
which  also  satisfy  these  three  criteria,  Two  of  the  other  designs,  both 
irregular  fractional  factorials  and  hence  special  clumpwise-orthogonal, 
are  that  obtained  by  omitting  0000,  0011,  1110,  and  1101,  and  that 
obtained  by  omitting  0000,  0110,  0101,  and  0011,  The  third  design, 
neither  clumpwise-orthogonal  nor  permutation-invariant,  omits  0000,  1110, 
1101,  and  1011. 

Linear  constraints  for  permutation-invariant  designs 

For  orthogonal  designs  the  total  number  of  run*  in  certain  subsets 

of  treatment  combinations  is  equal  to  a  known  multiple  of  the  single 

index  X.  For  permutation-invariant  designs,  the  total  number  of  runs 

at  treatment  combinations  in  thess  subsets  is  equal  to  a  known  multiple 

3  2 

of  one  of  several  indices.  To  illustrate,  constraints  for  a  2  3 
permutation-invariant  design  of  strength  2  will  be  derived, 

According  to  the  definition,  the  number  of  times  s  pair  of  tvo-level 
factors  appear  at  levels  0  and  0  is  independent  of  the  choice  of  the 
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pair.  Denoting  this  number  by  cr^»  we  obtain  the  constraints  [OOxxx]  «  a^» 
[OxOxx]  *  ay ,  and  [rOOrx]  *  .  Similarly,  the  number  of  times  a  pair 

of  two-level  factors  appear  at  levels  0  and  1  (or  equivalently  at  1  and 
0)  is  a  constant  which  may  differ  from  c^.  The  number  of  times  a 
pair  of  two-level  factors  appear  at  levels  1  and  1  is  o^*  By  considering 
the  three-level  factors  six  more  indices  may  be  defined.  Let  ...»  gjg 
be  the  number  of  occurrences  of  the  combinations  00,  01,  02,  11,  12,  and 
22  respectively.  Finally,  consideration  of  one  factor  at  tvo  levels  and 
one  at  three  levels  produces  the  indices  Y^,  ...,  Y^  corresponding  to 
the  combinations  00,  01,  02,  10,  11,  and  12  respectively,  A  set  of  linear 
constraints  may  now  be  written  down.  As  usual,  the  complete  set  is 
linearly  dependent,  and  in  Table  III  a  linearly  independent  subset  has 
been  extracted. 

As  was  noted  previously,  an  orthogonal  design  is  a  special  case  of 
a  permutation-invariant  design.  More  specifically,  an  orthogonal  design 
of  strength  t  is  a  permutation-invariant  design  of  strength  t  in  which 
all  the  indices  are  the  appropriate  multiples  of  the  single  index  X* 

It  may  be  verified  that  if  in  Table  III  ct^  is  replaced  by  9X,  each 
0  by  4X,  and  each  Y  by  6X,  the  constraints  are  those  of  a  strength 
2  orthogonal  design. 


115 


Table  III 


[OOxxx]  - 

[xOxlx]  m 

[xxxOO]  *  ^ 

[OxOzx]  - 

[xOxxO]  m 

[xxxOl]  *  ^ 

[xOOzx]  - 

[xOsxl]  «  Y2 

[xxx02]  »  ^ 

[OxxOx]  m 

[rOrx2]  -  ^ 

[xxxlO]  - 

[Oxzlx]  « 

[rzOOx]  -  ^ 

[yaorll]  «  ^ 

[OxxxO]  -  ^ 

[xxOlx]  ■  X, 

[xxxl2]  »  ^ 

[Oxxxl]  -  ^2 

[xxOrO]  -  ^ 

[xxx20]  -  ^ 

{OzxxZ}  «  ^ 

[ zxOxl]  »  ^ 

[xxx2l]  -  % 

[xOxOx]  .  ^ 

[xx0x2]  » 

[xxx22]  «  36 

Constraints  fcr 

3  2 

2  ?  parmitatioa-inrarlaat  design. 

APPENDIX 


fhis  Appendix  contains  proofs  of  two  Theorems  used  in  Section  3* 

Recall  that  the  column  of  the  coefficient  matrix  X  which  is  the 
coefficient  vector  for  the  p&raaaeter  A  ia  denoted  by  *y  Here  J  may 
be  I  (the  grand  naan),  A»  B,  •  ••»  AB,  AC,  •  etc*  Two  pairs  of 
columns  have  the  same  inner  product  if  their  subscript a  have  the  same 
generalized  interaction*  Special  dumpwiee-orthogonal  designs  can  there¬ 
fore  be  characterized  in  terms  of  which  generalized  interactions  give 
rise  to  inner  products  equal  to  the  nonzero  off-diagonal  element  or* 

For  a  design  of  resolution  t  ♦  1  only  parameters  involving  -J-t  or 
fewer  factors  are  involved  in  the  cross-product  matrix*  The  generalized 
interactions  giving  rise  to  non-zero  inner  products  may  contain  up  to  t 
letters*  If  such  a  generalized  interaction  is  multiplied  by  a  given 
effect  or  interaction,  and  if  the  resultant  contains  1  or  fever  letters, 
then  the  resultant  and  the  given  parameter  appear  in  the  asms  dump* 

THEOREM  3«  If  two  generalized  Interactions  give  rise  to  inner  prod¬ 
ucts  equal  to  ©,  and  if  their  product  contains  t  or  fever  letters, 
then  in  order  for  the  design  to  be  special  clumpwise-orthogoaal  their 
product  must  also  give  rise  to  inner  produets  equal  to  cr« 

PROOF*  Let  o  be  the  number  of  letters  common  to  both  generalised 
interactions,  d  be  the  number  of  unique  letters  in  one,  and  e  the 
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number  of  unique  letters  in  the  other.  Since  generalized  interaction* 
giving  rice  to  non-zero  inner  products  oust  contain  t  or  fewer  letters  9 
we  hare  c  +  d  A  t  and  c  +  e  a  t.  Also  d  +  e  At,  since  by  hypothesis 
the  product  of  the  two  generalized  interactions  contains  no  wore  than  t 
letters.  Without  losing  generality  it  nay  be  assuned  that  d  i}t, 

She  theorem  can  now  be  proved  by  exhibiting  for  each  of  several  cases  a 
pair  of  x's  which  must  be  in  the  sane  clump,  and  the  generalized  inter¬ 
action  of  whose  subscripts  is  the  product  of  the  two  original  generalized 
interactions.  Let  v  *  i  t,  so  that  only  x's  with  v  or  fewer  letters 
in  their  subscripts  can  be  considered.  Let  the  two  generalized  inter¬ 
actions  be  denoted  by  (i^,  ...»  1Q,  ...»  jd)  and 

(i^f  »M|  igf  «•«• 

CASK  1.  c  >  v,  d  and  e  unrestricted.  By  multiplying  the  two 
generalized  interactions  by  (i^,  ...»  1^)  it  is  found  that  the  three 
x's  with  subscripts  (i^»  •••*  •••»  *c»  •••»  ^d)» 

and  (i^^,  ••••  1Q,  ...,  k#)  are  all  in  the  sane  clump.  Hence  the 

inner  product  of  the  last  two  must  be  equal  to  a,  so  that  all  pairs  of 
x's  whose  subscripts  have  the  generalised  Interaction 
(j^»  •••»  kj*  •••#  k#)  must  have  inner  product 

CASK  II.  c  A  v,  e  a  v*  d  unrestricted.  By  multiplying  the  two 
generalized  interactions  by  (i^„  ...»  iQ)  it  is  found  that  the  three 
x's  with  subscripts  (i^  ...,  ic),  (Jlf  3d),  and  (l^,  k#) 
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must  all  be  in  the  sane  clump.  Therefore  all  pairs  of  x*s  whose  sub¬ 
scripts  hare  generalised  interaction  (j,t  k.,  •  *.*  k#)  must 

have  their  inner  product  equal  to  o.  If  c  its  sere,  (ig,  **.*  i^) 
is  taken  to  be  the  single  subscript  I. 

CiSE  Ill*  c  i  i,  e  >  v,  d  unrestricted*  By  suit  inlying  the  two 
generalised  interactions  by  (xgt  i^,  kg*  ...»  k^^)f  where  g  is 
the  larger  of  e  and  df  it  is  found  that  the  x*s  with  subscripts 

•••'  *o*  *1 . K-gt’  *41*  •***  J4*  kl . k*-p*  •Bd 

(k  . ,  *  k  )  are  in  the  same  clump,  Bote  that  the  number  of  letters 
»”gri  e 

in  these  three  generalised  interactions  are  v  -  g  ♦  c*  t  -  g  +  d,  and 
*-»*g  and  these  three  numbers  are  all  less  than  or  equal  to  v,  so 
that  the  three  x*s  named  all  appear  in  the  X  matrix* 

TEQBG3KH  5.  In  a  special  clumpwise-orihegonal  design  of  resolution 
t  *  1*  if  for  a  subset  of  t  factors  the  group  of  generalised  inter¬ 
actions  giving  rise  to  non-zero  inner  products  is  of  order  2^,  then  a 
is  an  integral  multiple  of  2*,  where  p  *  t  -  q. 

PROOF.  Given  a  subset  of  t  factors*  suppose  it  is  possible  to 
pick  two  z*s  from  the  complete  design  matrix  which  are  net  in  the 
sane  clump  with  each  other  or  with  Xj*  Then  the  x  whose  subscript  is 
the  generalised  interaction  of  the  first  two  x8»  must  be  in  a  fourth 
clump*  for  if  not*  the  first  two  x's  would  either  be  in  the  sans  clump 
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or  one  would  bo  in  the  clump  containing  Xg*  In  general*  then ,  it  is 
possible  to  pick  a  set  ox  2^  x'b,  one  from  each  clump,  whose  subscripts 
form  a  group.  But  this  sot  of  2^  x8e  itself  forms  a  design  matrix  of  a 
resolution  2p  +  1  design  in  p  factors  which  can  be  taken  to  be  any 
set  of  p  generators  of  the  group.  Because  each  x  comes  from  a  (sepa¬ 
rate  clump,  it  is  an  orthogonal  design.  By  a  result  of  Webb  (l%5),  this 
design  must  consist  of  the  full  factorial  replicated  some  number,  k,  of 
times*  Therefore  the  coefficient  matrix  X  of  this  smbdesign  consists 
of  2P  sets  of  k  equal  row  vectors. 

Wow  consider  any  vector  Xg  which  is  in  the  same  clump  with  Xg. 
Since  Xg  is  orthogonal  to  each  column  (except  Xg)  of  the  coefficient 
matrix  for  the  orthogonal  subdesign,  the  projection  of  Xg  on  the  sub- 
space  spanned  by  the  columns  of  the  orthogonal  subdesign  must  be  a  con¬ 
stant  times  Xg*  This  subspaot  consists  of  all  vectors  which  are  constant 
over  each  set  of  k  consecutive  components,  so  that  the  projection  of 
onto  this  subspace  replaces  each  component  in  a  set  by  the  average  of  the 
k  components  in  the  set*  Since  the  projection  is  a  multiple  of  Xg, 
the  sum  of  sets  of  k  consecutive  components  of  Xg  must  be  a  constant 
c,  and  since  Xg  consists  of  plus  and  minus  ones,  o  must  bs  integral. 
But  a  is  equal  to  the  inner  product  of  Xg  and  Xg,  which  is  the  sum 
of  all  the  components  of  Xg,  which  must  be  e  times  2^. 
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APPENDIX  C 


OPTIMALITY  PROPERTIES  OF  ORTHOGONAL  DESIGNS 

SUMMARY 

It  is  well  known  that,  subject  to  the  restriction  that  the  squared 
lengths  of  the  column  vectors  of  the  design  matrix  be  fixed,  the  variances 
of  the  individual  parameter  estimates  are  each  minimized  if  and  only  if  an 
incomplete  factorial  design  is  orthogonal.  Except  in  the  case  of  2n 
experiments  the  restriction  will  often  be  artificial,  and  without  the 
restriction  it  is  shown  that  this  particular  optimality  property  no  longer 
applies. 

Two  criteria  for  optimality  will  he  considered?  a)  minimization  of 
the  volume,  adjusted  for  the  number  of  runs,  of  a  confidence  ellipsoid  on 
the  estimated  parameters,  and  b)  minimization  of  the  average  variance  of 
a  predicted  value  adjusted  for  the  number  of  runs,  where  the  average  is 
taken  over  all  points  of  the  full  factorial.  Considering  parameteriza- 
tions  for  which  the  full  factorial  is  orthogonal  and  which  involve  main 
effects  only,  it  is  shown  that  criteria  (a)  and  (b)  are  both  satisfied 
if  and  only  if  the  design  is  orthogonal. 

These  criteria  are  also  satisfied  for  many  more  complicated  parameter- 
izations,  including  main  effects  plus  all  two-factor  interactions,  main 
effects  plus  all  two-  and  three-factor  interactions,  etc.  They  also  are 
satisfied  for  any  nonsingular  transformation  of  any  of  these  parameteriza- 
tions. 
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1.  INTRODUCTION 


This  paper  is  concerned  with  factorial  designs,  in  which  a  number  of 
factors  are  each  restricted  a  priori  to  a  specified  number  of  levels,  in 
contrast  to  regression  designs,  in  which  the  factors  are  continuously  vari¬ 
able.  Whether  or  not  a  factorial  design  is  orthogonal  depends  on  the 
parameterization  used  to  describe  the  response.  The  expected  values  of 
the  response  of  interest  will  vary  between  the  points  or  treatment  combina¬ 
tions  of  the  full  factorial,  and  the  parameters  are  defined  in  terms  of  these 
expected  responses.  A  canonical  parameterization  will  be  specified  in  the 
following  manner t  The  grand  mean  p,  is  the  average  expected  response, 
averaged  over  every  point  of  the  full  factorial.  The  main  effects  for  a 
factor  at  say  s  levels  are  defined  in  terms  of  the  quantities 
jli,  i  =  0,  1,  ...»  s  -  1,  which  are  the  expected  responses  averaged  over 
all  treatment  combinations  in  which  the  given  factor  is  at  level  i.  More 
specifically,  there  are  s  -  1  main-effect  parameters  each  defined  as  a 
linear  combination  of  the  p,^,  and  such  that  the  linear  combinations  are 
orthogonal  and  the  sums  of  their  coefficients  are  zero.  It  will  be  con¬ 
venient  for  later  use  to  let  the  linear  combinations  be  scaled  so  that  each 
has  squared  length  l/s.  Interaction  parameters  involving  two  or  more  factors 
are  similarly  defined  in  terms  of  average  expected  values  with  the  factors 
involved  at  fixed  levels. 

Once  a  set  of  parameters  is  defined  in  terms  of  the  expected  responses 
at  the  various  treatment  combinations,  the  process  can  be  reversed,  and  the 
expected  responses  may  be  expressed  approximately  as  linear  combinations 
of  the  parameters.  If  there  are  as  many  parameters  as  points  in  the  full 
factorial  (that  is,  if  interactions  of  all  orders  are  defined), 
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then  the  expected  responses  can  be  expressed  exactly.  If  there  are  fewer 
parameters,  then  some  degree  of  approximation  is  necessary.  In  practice, 
however,  the  approximation  is  often  good,  relative  to  experimental  error, 
even  when  the  number  of  parameters  is  much  smaller  than  the  number  of  points. 

Given  N  observations  at  any  set  of  treatment  combinations,  the  values 
of  the  responses  can  be  expressed  in  the  form  Y  =  Xp  +  e.  Here  Y  is  th& 

H  x  1  vector  of  responses,  0  is  the  p  x  1  vector  of  parameters,  X  is 
an  N  x  p  matrix  called  the  coefficient  matrix,  and  e  is  an  N  x  1  vector 
of  errors.  If  the  approximation  of  expected  values  by  linear  combinations  of 
parameters  is  adequate,  then  the  components  of  e  are  assumed  to  be  indepen¬ 
dent  and  identically  distributed  experimental  errors.  If  the  matrix  X'X  is 
nonsingular,  then  the  least-squares  estimate  of  the  vector  g  is  g  =  (X'X^^X'Y. 
The  covariance  matrix  of  g  is  (X'X)  a  ,  where  a  is  the  common  unknown 
variance  of  the  components  of  e. 

2.  CRITERIA  FOR  OPTIMALITY 

Consider  a  class  of  designs  for  which  the  lengths  of  the  column  vectors 
of  X  are  fixed.  If  there  is  an  orthogonal  design  (i.e.  X'X  is  diagonal) 
in  this  class,  then  the  variances  of  all  the  estimates  will  be  smallest  using 
this  design.  This  is  a  well  known  result  which  was  proved  by  Plackett  and 
Burman[l].  The  restriction  that  the  lengths  of  the  columns  be  fixed  may 
appear  to  be  simply  a  natural  way  of  fixing  the  scale  of  measurement,  but  for 
factorial  designs  it  is  a  very  artificial  restriction.  If,  as  is  more  natural, 
the  number  of  runs  and  the  parameterization  are  fixed,  then  this  optimality 
property  of  orthogonal  designs  no  longer  holds,  as  the  example  in  the  follow¬ 
ing  paragraph  indicates. 
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An  experimental  situation  will  be  considered  in  which  there  is  a  single 
factor  at  three  levels.  This  example  is  chosen  for  simplicity,  but  it  is 
nontrivial  in  the  sense  that  the  conclusions  which  can  bo  drawn  from  it 
apply  equally  well  to  more  comprehensive  experimental  situations,  The  param¬ 
eters,  defined  in  terms  of  the  expected  responses  y,Q,  and  y,  ,  are  the 
grand  mean  =  (|*  +  ^  +  n2)/3,  the  "linear  effect"  =  (y2  -  p.Q)//6, 
and  the  "quadratic  effect"  ^  -  2^  -f  ^ The  design  containing 

two  runs  at  each  treatment  combination  is  orthogonal  and  the  variances  are 
2 

c  /6  for  each  of  the  three  estimates.  The  design  containing  two  runs  at 

P  /s 

level  0,  one  run  at  1,  and  three  runs  at  2  has  variances  11c  /54  for 
5o^/36  for  and  29o2/l08  for  $2*  Thus  the  variance  for  ^  is  smaller 
than  for  the  orthogonal  design  with  the  same  number  of  runs.  The  design  with 
one  run  at  level  0,  three  runs  at  1,  and  two  runs  at  2  yields  variances  of 
lla2/54  for  $Q,  o2/4  for  and  17c2/l08  for  $2.  If  one  uses  this 

design,  the  variance  of  ^  is  smaller  than  for  the  orthogonal  design. 

It  is  apparent  from  this  example  that  the  given  criterion  for  optimality 
is  too  strongs  that  is,  there  will  in  general  be  no  design  for  which  the  vari¬ 
ance  of  each  estimate  will  be  minimized.  It  therefore  seems  appropriate  to 
search  for  other  criteria  for  optimality. 

The  purpose  of  a  factorial  experiment  is  usually  either  to  estimate  param- 
eters  or  to  describe  the  response  over  the  grid  of  possible  treatment  combina¬ 
tions.  Consequently,  it  seems  natural  to  consider  two  criteria  corresponding 
to  these  two  situations. 

Consider  designs  of  N  runs  with  p  factors.  The  first  criterion  is 
based  on  the  product  of  the  determinant  of  the  matrix  (X  x)~*  and  If 
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an  N-run  design  is  replicated  r  times,  so  that  the  oomplete  design  contains 
Nr  runs,  each  element  of  the  covariance  matrix  a  (X'X)  of  the  original 
design  is  divided  by  r  and  the  determinant  of  (X'X)  is  divided  by  r  . 

The  purpose  of  multiplying  the  determinant  by  E*5  is  therefore  to  adjust  for 
the  number  of  runs.  The  reason  for  using  the  determinant  in  the  first  place 
is  as  follows.  A  confidence  set  for  the  parameters  is  given  by  the  set  of 
points  for  which  ( 0-|)  *  (X'X)  (  £-$)  £  ps^,  where  s^  is  the  residual  sum 
of  squares  and  where  K  is  a  constant  depending  on  the  confidence  level 
(l-o)»  p»  N,  and  the  distribution  of  the  errors.  If  the  errors  are  normally 
distributed,  K  is  the  upper  rv  point  of  the  F  distribution  with  p  and 
N-p  degrees  of  freedom.  The  volume  of  the  confidence  ellipsoid  so  determined 
is  given  by 

-  2>  (pAfo  t 

pr(*p)  Ajdet(X*X) 

Therefore,  the  volume  of  this  ellipsoid  is  a  monotonically  increasing  function 
hf  the  determinant  of  (X'X)”^. 

The  second  criterion  is  appropriate  when  the  purpose  of  the  experiment 
is  to  predict  the  response  equally  well  at  each  treatment  combination  of  the 
full  factorial.  Such  predictions  can  be  made  using  the  parameter  estimates, 
and  the  variances  of  the  predictions  depend  on  the  variances  and  covariances 
of  these  estimates.  The  criterion  proposed  is  the  average  variance  of  a  pre¬ 
dicted  value,  times  the  number  of  runs.  Multiplication  by  N  again  has  the 
effect  of  scaling  for  the  number  of  runs. 

5.  MAIN-EFFECT  PARAMETE&IZATIONS 

In  this  section  only  main-effect  paramo ter iaat ions  will  be  considered,  but 
the  numoer  of  factors  and  the  numbers  of  levels  of  each  factor  are  arbitrary. 
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Let  2  be  the  coefficient  matrix  of  the  full  factorial  design  consisting  of 
p  factors  and  let  M  be  the  number  of  runs  in  this  design.  Due  to  the 
particular  choice  of  scaling  used  in  defining  the  parameters,  the  following 
theorem  is  true. 


THEOREM  1.  The  squared  length  of  every  column  vector  of  Z  is  M  and 
the  squared  length  of  every  row  vector  is  p. 


PROOF.  Consider  an  arbitrary  factor  at  any  arbitrary  number  s  of  levels. 

With  this  factor  are  associated  s  -  1  columns  of  Z,  the  coefficient  vectors 

for  the  main  effects  of  this  factor.  The  components  of  these  coefficient 

vectors  may  be  determined  from  the  definition  of  the  main-effect  parameters. 

^  represent  the  main  effects 

of  the  given  factor.  These  parameters  are  defined  as  linear  combinations  of 

u  ,  ...,  S  Let  Q  =  £q.  . }  be  the  matrix  of  the  coefficients  of  these 

O  S— 1  1J 

linear  combinations,  so  that  we  have 


Let  represent  the  grand  mean  and  {5^, 


•  • . , 
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**—  — 
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_Ps-l_ 
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l/s  .  . 

.  l/s 
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•  q2s 
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lsl 


ls2 


•  •  • 


lss 


^1 


u, 


s-1 


which  may  be  written  in  matrix  notation  as  3*  =  Qu*  (by  making  the  obvious 
definitions).  Note  that  the  elements  of  the  firs+  row  of  the  matrix  Q  are 
all  l/s,  and  that  this  is  consistent  with  the  definition  of  the  grand  mean 
given  in  the  introduction.  By  definition  the  rows  of  Q  are  orthogonal  and 
have  squared  length  l/s. 


Consider  the  matrix  P  =  Q~\  It  can  be  verified  that  due  to  the  orthogo¬ 
nality  of  the  columns  of  Q,  P  is  as  follows 
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p  = 


1 

1 


8q 


21 


f  •  • 


aq 


S*2s  *  * 


aq 


si 


22  •  •  • 


aq 


s2 


aq 


as 


or  P  a  sQ*.  Now  PP'  =  P'P  =  si,  so  that  the  sum  of  squares  of  any  row 
or  column  of  P  is  a. 


Consider  now  the  subnatrix  consisting  of  the  columns  of  Z  associated 
with  the  main  effects  of  the  factor.  Any  row  of  this  submatrix  h^s  the 
same  elements  as  a  row  of  P  with  the  first  element  omitted.  Since  the 
design  is  the  full  factorial,  each  level  of  the  factor  appears  the  same 
number  of  times,  and  hence  each  row  of  P  is  repeated  the  same  number  of 
times  k.  The  constant  k  is  the  product  of  the  number  of  levels  of  the 
remaining  factors.  The  sum  of  squares  of  each  column  of  the  submatrix  is 
therefore  k  x  s,  which  equals  M,  The  sum  of  squares  of  each  row  of  the 
submatrix  is  obviously  s  -  1. 

Finally,  consider  the  entire  matrix  Z,  The  squared  length  of  columns 
associated  with  main  effects  has  been  shown  to  be  M  and  the  column  associ¬ 
ated  with  the  grand  mean  has  1  for  all  its  components,  hence  also  has  squared 
length  M.  It  has  also  been  shown  that  the  squared  length  of  the  part  of  a 

row  associated  with  the  i-th  factor  is  s^-l,  so  that  the  total  squared 
n 

length  is  1  +  T  (s.-l),  which  equals  p,  the  number  of  parameters.  The 
i=l  1 

proof  is  now  complete. 


Using  Theorem  1  and  the  facts  that  the  trace  and  the  determinant  of  a 
matrix  are,  respectively,  the  sum  and  the  product  of  its  eigenvalues,  we 
can  prove  the  following  theorem. 
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THEOREM  2.  Both  criteria  [(a)  and  (b)]  are  satisfied  if  and  only  if  a 
design  is  orthogonal  tinder  the  canonical  main-effect  parameterization,  in 
which  case  they  will  also  be  satisfied  under  any  other  main-effect  parameter¬ 
ization,  and  the  variances  of  all  predicted  values  are  equal. 

PROOF*  Consider  first  the  average  variance  of  a  predicted  value  times 
the  number  of  runs*  This  is  given  by 

(o^n/m)  trace  ZCX'X)-^’  =  o2N  trace  (X'X)"1  =  c2N  ?  l/X, ,  where  the  JL 

i»l  1 

are  the  eigenvalues  of  X'X.  Since  by  Theorem  1  the  sums  of  squares  of 
elements  of  the  rows  of  Z  are  all  p,  the  same  is  true  of  X,  and  we  have 
traoe  XX'  *  trace  X'X  «  Np,  so  that  £X^  =  trace  X'X  is  fixed.  In  order  to 
minimize  NH/X^  with  £X^  fixed,  the  X^  must  all  be  equal  to  N,  which 
implies  that  X'X  is  a  scalar  multiple  of  the  identity  matrix.  In  this  case 

the  variances  of  all  predicted  values  (the  diagonal  elements  of  a  Z(x,X)~xZ' ) 

2  2 
are  equal  to  o  p/N,  and  N  times  the  average  is  o  p/M.  The  chosen 

determinant  criterion  is  NPdet(X'X)”\  To  minimize  det  (X'X)"”*  with  TX^ 

fixed,  the  X^  must  again  all  be  equal,  and  the  value  c*  the  criterion  is 

unity. 

Now  consider  any  nonsingular  reparameterization  &  =  The  expression 

for  the  first  criterion  takes  the  form  (ct2N/m)  trace  ZA(A'X'XA)  ^'Z',  so 
that  the  A's  cancel  and  the  value  is  the  same  as  before.  The  second  criterion 

p  12 

becomes  Npdet(X'X) “V(detA)  ,  so  that  under  a  new  parameterization  both 
criteria  are  minimized  if  and  only  if  the  design  is  orthogonal  under  the 
canonical  parameterization.  This  completes  the  proof. 
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4 .  HIGHER-ORDER  PARAMETERIZATIONS 


In  the  proof  of  Theorem  2  the  only  reason  for  restricting  the  parameter¬ 
ization  to  contain  only  main  effects  is  so  that  Theorem  1  will  apply.  There¬ 
fore,  Theorem  2  applies  to  any  canonical  parameterization  for  which  Theorem  1 
holds#  It  will  now  be  demonstrated  that  Theorem  1  is  valid  for  many  canonical 
parameterizations  involving  interactions# 

Let  Z  be  the  coefficient  matrix  of  the  full  factorial  under  a  canonical 
parameterization  including  the  grand  mean,  main  effects,  and  two- factor  inter¬ 
actions#  Consider  those  columns  corresponding  to  the  main  effects  and  the 
two-factor  interactions  involving  factors  at  s^  and  s 2  levels.  Treating 
these  factors  and  their  interactions  as  a  single  s^s^-level  factor,  we  find 
that  Theorem  1  applies  and  that  the  squared  lengths  of  these  columns  are  all 
equal  to  M  and  that  the  squared  length  of  the  rows  is  8^2-1.  We  know  from 
Theorem  1  that  the  portion  of  this  squared  length  attributable  to  the  columns 
for  the  main  effects  of  the  two  factors  is  (s^-l)  +  (s2“l).  The  remainder  is 
(s^-lHsg-l),  which  is  equal  to  the  number  of  interaction  narameters.  This 
argument  may  be  repeated  for  the  interactions  between  any  pair  of  factors,  or 
for  that  matter  for  interactions  involving  any  number  of  factors.  Therefore, 
the  conclusions  of  Theorem  1  are  valid  whenever  parameters  appear  onlj  in 
complete  sets,  which  are  sets  such  that  if  any  interaction  parameters  between 
a  group  of  factors  are  included,  then  all  possible  such  parameters  are  included. 
Theorem  2  may  therefore  be  generalized  as  follows. 

THEOREM  3.  Under  any  canonical  parameterization  involving  only  complete 
sets,  or  a  nonsingular  transformation  of  such  a  parameterization,  the  deter¬ 
minant  and  average  variance  criteria  are  satisfied  if  and  only  if  the  design 
is  orthogonal  under  the  canonical  parameterization. 
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It  is  of  considerable  interest  to  determine  which  designs  satisfy  the 
two  criteria  if  the  parameterization  does  not  consist  only  of  complete  sets* 
The  simplest  such  situation  involves  two  factors,  one  at  two  and  the  other 
at  three  levels.  The  main  effects  for  the  three-level  factor  are  the  linear 
effect  and  the  quadratic  effect,  as  in  the  example  discussed  earlier.  Simi¬ 
larly,  the  interaction  between  the  two  factors  may  be  parameterized  using 
linear  and  quadratic  components.  Of  the  two,  suppose  only  the  quadratic 
component  is  included  in  the  model.  Then  the  matrix  Z  is  as  follows: 
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-i/yr 
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-2/J2 

2/J2 
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treatment  combinations 

00,  01, 

12,  respectively.  Consider  designs  for  which  the  number  of  runs  at  treatment 
combinations  00,  02,  10,  and  12  is  a  constant  X  and  the  number  of  runs  at 
combinations  01  and  11  is  ^(N-4X).  For  such  designs  the  cross-product  matrix 
is  of  the  form 


X  X  = 
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We  seek  the  value  of  x/N  which  will  minimize  N5/det(X’X)  =  N5/3523X5(N-4X)2. 
This  value  is  x/N  =  3/20,  for  which  the  criterion  has  approximately  the  value 
.95  (the  value  for  the  orthogonal  design  being  1.0).  The  trace  of  the  inverse 
times  N  is  (3N  -SXiO/dXCN^X) ,  for  which  the  minimizing  value  is  x/N  = 
(I)-/})/®.  The  criterion  has  the  value  4.98,  again  slightly  better  than  the 
value  of  5.00  obtained  with  the  orthogonal  design. 
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APPENDIX  D 


DESIGNS  FOR  STUDYING  ONE  FACTOR  AT  A  TIME 

INTRODUCTION 

It  ha*  become  axiomatic  in  the  statistical  experimental  design  literature 
to  discourage  the  practice  of  varying  one  factor  at  a  time.  In  the  case 
of  factors  each  at  tiro  levels,  the  support  for  this  point  of  view  is  that 
the  variances  of  the  main-effect  estimates  using  such  designs  are  con¬ 
siderably  larger  than  with  orthogonal  designs. 

On  the  other  hand,  the  experimenter  often  likes  such  designs  because  he 
finds  out  more  rapidly  whether  a  new  factor  has  any  effect.  He  con¬ 
tinually  receives  information  rather  than  having  to  wait  till  the  entire 
experiment  is  completed.  If  the  magnitudes  of  the  effects  he  is  inter¬ 
ested  in  are  several  times  as  large  as  experimental  error,  if  he  does 
not  need  to  describe  these  affects  precisely,  and  if  there  are  no  inters¬ 
ections,  there  is  no  particular  disadvantage  in  experimenting  in  this  way. 

Another  advantage  of  one-at-a-time  experiments  is  that  they  are  contrac¬ 
tible  or  expansible  without  limit.  Thus,  no  matter  how  many  tests  have 
been  run,  the  experiment  may  be  stopped,  in  which  case  estimates  of  the 
effects  of  those  factors  which  have  already  been  varied  may  be  obtained, 
or  it  may  be  continued  by  the  introduction  of  a  new  two-level  faotor. 
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An  excellent  exposition  of  the  statistical  arguments  against  such  designs 
was  given  by  Fisher  in  The  Design  of  Experiments,  sections  37  and  38  [2], 
He  bases  his  attack  on  their  low  efficiency  compared  with  orthogonal 
designs  and  their  lack  of  information  about  interactions. 

Cuthbert  Daniel  [l]  has  presented  the  positive  aspects  of  these  designs. 
He  pointed  out  that  they  are  expansible  and  contractible  and  that  they 
provide  a  quick  look  at  each  factor.  He  went  on  to  say  that  they  can 
often  be  augmented  to  form  a  half  replicate  plus  one  additional  run,  in 
which  case  the  lost  efficiency  is  for  the  most  part  regained. 

It  is  the  purpose  of  this  paper  to  investigate  the  properties  of  one-at- 
a- time  designs.  It  is  pointed  out  that  there  are  many  classes  of  such 
designs.  A  lower  bound  is  obtained  for  the  variances  of  the  estimates 
using  any  such  design,  and  several  classes  are  given  for  which  the  lower 
bound  is  obtained.  The  assumptions  are  that  each  factor  has  only  two 
levels,  that  no  interactions  are  present,  and  that  there  is  no  effect  of 
the  order  of  observations  on  the  response.  The  last  assumption  is  neces¬ 
sary  because  the  order  of  the  runs  cannot  be  randomized.  The  number  of 
factors  need  not  be  specified  in  advance • 


BOUNDS  FOR  THE  VARIANCES  OF  THE  EFFECTS 


We  consider  factors  at  two  levels.  For  convenience,  the  initial  level 
of  each  factor  will  be  considered  the  low  level  and  will  be  denoted  by 
0  or  The  first  run  will  have  each  factor  at  its  low  level.  If 

there  are  n  factors  the  design  contains  n  ■*>  1  runs*  The  i-th  factor 
appears  for  the  first  time  at  its  high  level  in  run  i  t  1,  After  it  has 
been  introduced  it  may  stay  at  its  high  level,  revert  to  its  low  level, 
or  be  varied  between  its  two  levels  on  subsequent  tests*  Thus  there  is 
a  wide  latitude  of  possible  one-at-a-time  designs* 

Experiments  for  estimating  the  main  effects  of  two-level  factors  are  con¬ 
ventionally  analyzed  in  terms  of  the  coefficient  matrix  X  as  follows. 

The  first  column  of  X  corresponds  to  the  grand  mean  and  has  all  its 
components  equal  to  1.  Each  of  the  remaining  columns  corresponds  to  one 
of  the  factors,  and  each  row  corresponds  to  a  run*  According  to  whether 
a  given  factor  is  at  its  high  or  low  level  in  a  given  run,  the  correspond¬ 
ing  element  of  X  contains  the  entry  +1  or  -1* 

Suppose  a  vector  7  of  N  responses  is  obtained  from  the  experiment* 
Under  the  assumption  that  there  are  no  interactions  we  may  write 
Y  *  Xp  ♦  e,  where  3  i®  the  vector  of  the  unknown  parameters  and  e 
is  a  vector  of  independent  random  errors  having  mean  zero  and  common 
variance  a2*  The  least-squares  estimate  0  of  0  is  0  =  (X'X)“’1X,Y. 


2 

The  covariance  matrix  of  $  is  a  (X'X)~  . 

In  addition  to  working  with  the  traditional  coefficient  matrix  X,  it 
will  be  convenient  to  introduce  a  reduced  matrix  R.  Where  X  has  an 
element  1,  R  also  has  1;  where  X  has  a  -1,  R  has  a  zero.  It 
may  be  verified  that  X  and  R  are  related  through  the  triangular 
transformation  matrix  T  according  to  the  equation  X  *  RT,  as  in  the 
following  example. 
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0 

0 

0 

£ 

In  general,  t^  is  given  by  the  following  rules s 

hi  m  1 

t^j  *  —1  (j»2,  •«*,  n+l) 
tjj  *  2  (j»2,  n+l) 

t^  «  0  (otherwise). 

It  may  be  verified  that 
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X 


I.  (2) 


i  H  H  i 
0  0  0  0  0 
0  0  £  0  0  0 
0  0  0  i  0  0 
0  0  0  0  £  0 
0  0  0  0  0  £ 


If  X  is  the  coefficient  matrix  for  any  one-at-a-time  design,  then  the 
corresponding  R  is  lower  triangular,  and  has  l*s  down  the  main  diag¬ 
onal,  Therefore  the  determinant  of  R  is  unity.  Since  the  elements  of 
R  are  integers  any  minor  is  integral.  Since  the  elements  of  an  inverse 
are  by  definition  an  appropriate  minor  divided  by  the  determinant  of  the 
original  matrix,  R~^  also  consists  of  integers.  It  follows  from  (2) 
that  each  element  of  X-*1  *  T~^R”^  must  be  a  multiple  of  £. 


THEOREM  10  For  a  one-at-a-time  design  containing  n  factors  at  two 
levels  and  n  +  1  runs,  a  lower  bound  for  the  variance  of  any  estimate 
is  £ tr2  • 

2 

PROOF,  The  variances  of  the  estimates  are  a  times  the  diagonal  ele¬ 
ments  of  (X'X)*’^,  Because  X  is  square,  (X'X)“*  reduces  to  (x“*)(x“*)  , 
The  diagonal  elements  are  therefore  the  sums  of  squares  of  the  elements 
in  each  row  of  X~\  We  know  already  that  the  elements  of  X-**  are  all 
multiples  of  £.  The  sum  of  squares  of  the  elements  in  a  row  must  therefore 
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be  a  positive  multiple  of  •$•*  If  the  value  were  -J-,  then  all  the  ele¬ 
ments  would  be  zero  except  one  which  was  equal  to  +■$>  •  The  inner  product 
of  a  row  of  X~*  and  a  column  of  X  must  of  course  be  either  0  or  1. 
Since  the  elements  of  X  are  all  either  <*>1  or  -1,  the  inner  product 
of  any  column  of  X  with  a  row  containing  a  single  would  be  +jr* 
Therefore  a  lower  bound  to  the  sum  of  squares  of  elements  of  any  row  of 
X~^  is  and  the  theorem  is  proved* 
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CONSTRUCTION  AND  COMPARISON  OP  VARIOUS 
ONE-AT-A-TIME  DESIGNS 


The  most  familiar  family  of  one-at-a-time  designs  are  those  in  which  each 


factor  returns  to  its  low  level  after  it  has  been  "studied1*.  The  general 

-1  -1 

form  of  the  matrices  R,  R  ,  and  X  in  this  family  are  exemplified 
by  the  following  five-factor  case: 


1  0  0  0  0  0 

10  0  0 

1  1  0  0  0  0 

-110  0 

10  10  0  0 

10  0  10  0 

5  K"1  = 

-10  10 

-10  0  1 

1  0  0  0  1  0 

-10  0  0 

1  0  0  0  0  1 

-10  0  0 

It  will  be  noted  that  the  variance 


0  0 

-x*  it  tit 

0  0 

-i  ioooo 

0  0 

-1 

-i  0  £  0  0  0 

0  0 

5  X  A  * 

*4  0  0  7  0  0 

1  0 

«4  oooio 

0  1 
— 

4-  0  0  0  0  i 

2 

each  main-effect  estimates  is  , 


the  theoretical  lover  bound.  The  variance  of  the  grand  mean  is 
2  2 

o  (n  -3n-»4)/4  for  the  n-factor  case. 


* 


It  is  of  interest  to  inquire  whether  or  not  there  is  a  family  of  designs 

in  which  the  variance  of  the  grand  mean  is  also  at  the  minimum  level  of 

a  •  The  family  of  designs  in  which  each  factor  is  maintained  at  its 

high  level  satisfies  this  requirement.  Again  using  a  five-factor  example 

-1  -1 

to  illustrate  the  general  case,  R,  R  ,  and  X  are  as  follows: 


1J8 


1  0  0  0  0  0 
1  1  0  0  0  0 
1110  0  0 
11110  0 
111110 
111111 


1  0  0  0  0  0 

-1  1  0  0  0  0 

0-11000 
0  0-1100 
0  0  0  -1  1  0 

0  0  0  0  -1  1 


i  o  o  o  o  i 

4  1  0  0  0  0 

o  -4  -$■  o  o  o 

o  o  -4  £  o  o 

o  o  o  -4  |  o 

o  o  o  o  -J-  * 


These  two  classes  of  designs  are  two  extremes  in  which  all  factors  are 
either  returned  to  their  initial  level  or  are  held  at  their  new  level. 
Ordinarily  an  experimenter  would  like  to  determine  which  level  of  each 
factor  is  better,  and  conduct  the  remaining  experiments  at  the  more  de¬ 
sirable  level.  Thus,  in  general  some  factors  will  be  held  at  their  new 
level  and  some  will  be  returned  to  their  old  level.  In  a  sense,  the 
experiment  will  be  midway  between  the  extremes  already  investigated.  For 
these  extremes,  the  variances  of  the  main  effeots  attain  the  lover  bound 
of  -£-c  •  The  fact  that  the  grand  mean  has  a  higher  variance  with  the 
first  family  considered  will  often  make  little  difference  in  the  type  of 
experiment  in  which  one-at-a-time  designs  are  often  used.  It  seems 
appropriate  to  ask  whether  or  not  the  main-effect  variances  will  always 
be  if  factors  are  maintained  at  their  more  desirable  level  for  the 

duration  of  the  experiment.  The  answer  is  yes  as  can  be  seen  from  the 
following  theorem. 
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THEOREM  2.  For  any  one-at-a-time  design  in  which  each  factor  is  main¬ 
tained  at  either  the  high  or  low  level  after  the  initial  introduction  of 

p 

the  high  level*  the  variances  of  the  main-effect  estimates  are  all  -jra* • 

PROOF,,  Except  for  the  first  row,  each  entry  in  the  matrix  is  -jr 

-1 

times  the  corresponding  entry  in  R  .  Therefore  the  theorem  will  be 
proved  by  showing  that  each  row  of  R~^  except  the  first  contains 
one  +1,  one  -1,  and  the  remaining  entries  are  0.  The  proof  will  be 
inductive. 

The  theorem  is  obviously  true  for  the  case  of  one  factor.  Two  possibili¬ 
ties  exist  for  n  *  2: 


R^  * 

ioo" 

110 

10  0 

-110 

;  ,<*>. 

10  0 

110 

,  R^'1 

1 

10  0 

-110 

111 

1-4 

1 

o 

_ l 

10  1 

-10  1 

-1 

In  either  case  R  has  the  required  form. 

Assume  the  theorem  is  true  for  designs  involving  n  or  fewer  factors. 

The  reduced  design  matrix  Rg  for  a  design  involving  n  +  1  factors  may 
be  partitioned  as  follows s 


— 

■  * 

8o 

l  Z  |  2 

-  —  - 

| - 1  —  - 

r 

1  1  i  0 

—  —  — 

j  —  —  1  —  — 

r 

I  q  1  1 
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Here  R  is  the  n  x  n  reduced  design  matrix  for  a  design  of  the  re- 
o 

quired  type  involving  n  -  1  factors,  and  z  is  an  n-rowponent  vector 
of  zeros.  The  vector  r  consists  of  zeros  and  ones,  and  is  repeated  in 
the  last  two  rows  because  of  the  restriction  that  a  factor  is  held  at 
either  its  high  or  low  level  after  it  has  initially  been  varied.  The 
single  entry  q  will  either  be  zero  or  one  depending  on  whether  the 
n-th  factor  is  to  be  maintained  at  its  high  or  low  level  for  the  duration 
of  the  experiment.  The  inverse  of  R2  nay  he  partitioned  in  tho  same 
ways 


1 ' 0 
-I- 
c  i  1 


Since  R2  is  a  lower  triangular  matrix  with  R^  as  a  submatrix, 
is  also  lower  triangular  with  R^  (and  hence  also  R~  )  as  a  corres¬ 
ponding  submatrix.  Thus,  the  first  n  rows  of  R^  are  as  indicated 
above,  where  z  again  indicates  an  n-component  zero  vector.  By  the 
induction  hypothesis,  the  n-component  vector  a  has  all  its  entries 
zero  except  one,  which  is  equal  to  -1.  Suppose  the  component  q  in 
R2  is  equal  to  1.  Then  in  order  for  the  inner  products  of  the  last 
row  of  R2  and  the  first  n  +  1  columns  of  R^  all  to  have  the  value 
zero,  the  vector  b  must  be  an  n-component  zero  (row)  vector  and  o 
must  have  the  value  -1.  How  suppose  q  is  equal  to  zero.  In  order 


col- 


for  the  inner  product  of  the  last  rov  of  R2  an<*  the  n  +  1 

umns  of  ^2  all  to  have  the  value  zero,  b  must  equal  a  and  c  must 

equal  zero.  In  either  case,  the  last  row  of  R~^  contains  one  1,  one 

-1,  and  n  0's.  The  theorem  now  follows  by  induction. 

This  theorem  characterises  the  "optimum  class"  of  one-at-a-time  designs. 

We  conclude  by  giving  an  example  of  a  very  bad  one-at-a-time  design  which 
is  not  a  member  of  this  class.  Suppose  instead  of  returning  each  factor 
to  its  initial  low  level  immediately,  it  is  kept  at  the  high  level  for 
one  additional  run.  Intuitively  it  would  seem  that  the  latter  series  of 
designs  would  differ  little  from  the  former,  except  that  the  high  level 
is  "studied"  a  little  further.  In  fact  this  minor  modification  seriously 
degrades  the  quality  of  the  designs.  A  five-factor  example  of  this  series 
is  as  follows: 


1  0  0  0  0  0 

1  1  0  0  0  0 

1110  0  0 

-1 

~i  0  0  0  0  0 
-110000 

0-11000 

-1 

-4  4  0  4  0  4 
-4  4  0  0  0  0 
o-4|ooo 

101100 

;  R  = 

-11-1100 

;  x  « 

—4  4  -4  4oo 

10  0  110 

l  0  0  0  1  1 

0-1  1-1  1  0 

_-i  1-1  1-1  1 

o-4  4  *4  4  0 
j4  4-4  4-4  £ 

2 

In  general  the  first  two  main-effect  estimates  will  have  variance  4a  • 
the  next  two  n  ,  the  next  two  l4a ,  etc.  This  series  of  design  illus¬ 
trates  the  fact  that  apparently  inconsequential  modifications  of  designs 
can  in  fact  have  a  serious  effect  on  the  variances  of  the  estimates. 


IkZ 
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APPENDIX  E 


SOME  NEW  INCOMPLETE  FACTORIAL  DESIGNS 

SUMMARY 

k 

An  exhaustive  search  of  possible  twelve -run  2  designs  has  been  made. 
There  are  nineteen  essentially  different  design  configurations,  fifteen 
of  which  permit  the  estimation  of  the  mean,  main  effects,  and  tvo- 
factor  Interactions.  Two  have  been  previously  discussed  In  the  litera¬ 
ture.  Two  new  designs  seem  to  be  of  practical  value.  One  has  variances 
of  (15/128) a2  for  main  effects  and  (l/8)a2  for  two-factor  interac¬ 
tions;  the  other  has  variances  of  (7/64)  o2  for  the  main  effects,  but 
the  interactions  have  scesewhat  higher  variances . 

An  incomplete  2 1  design  with  29  runs  has  been  found  in  which  all  main 
effects  and  interactions  can  be  estimated  with  variance  .050a2 .  A 
modification  with  56  runs  allows  estimates  of  the  main  effects  with 
variance  •039(j2  and  of  two-factor  interactions  with  variance  .048a2 . 


INTRODUCTION 


This  paper  was 
Meeting  of  the 
Illinois .  The 
the  author  was 


originally  given  in  November  of  1961  at  a  Central  Regional 
Institute  of  Mathematical  Statistics  held  in  Urbana, 
research  work  was  done  at  the  University  of  Chicago  while 
a  graduate  student  in  the  Department  of  Statistics.  The 
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paper  describes  two  loosely  associated  Investigations  on  2n  incomplete 
factorial  designs  which  were  not,  however,  incorporated  into  the 
author's  thesis.  They  are  of  some  general  interest,  and  have  been 
referred  to  in  a  general  paper  on  non-orthogonal  designs  [2]  by  the 
author.  Since  the  latter  paper  has  been  submitted  for  publication,  and 
with  the  anticipation  that  its  publication  will  create  some  interest  in 
these  results,  this  paper  has  been  prepared  at  this  time . 


FOUR-FACTOR  TWELVE-RUN  IESIGHS 

In  the  case  of  four  two-level  factors,  there  are  four  main  effects,  six 
two-factor  Interactions,  four  three -factor  interactions,  and  one  four- 
factor  interaction.  If  interactions  involving  more  than  two  factors 
are  negligible,  there  are,  including  the  mean,  eleven  parameters  to  be 
estimated.  Two  twelve-run  designs  have  been  proposed  by  Peter  John  [l] 
to  handle  such  situations,  although  he  has  given  no  indication  as  to 
whether  or  not  his  designs  are  in  any  sense  optlmua.  In  an  attempt  to 
determine  the  relative  merits  of  his  designs,  I  have  made  an  exhaustive 
classification  of  all  possible  four- factor  designs  containing  runs  at 
twelve  distinct  treatment  combinations .  It  is  most  convenient  to  per¬ 
form  this  classification  in  terms  of  the  four  treatment  combinations 
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which  are  omitted. 


As  usual.  A,  B,  C,  and  D  denote  the  factors.  The  treatment 
combination  with  every  factor  at  its  low  level  will  be  denoted  by  (l), 
and  every  other  treatment  combination  by  those  (lover  case)  letters 
corresponding  to  the  factors  at  their  high  levels.  The  treatment  com¬ 
binations  form  a  group  whose  identity  is  (l)  .  The  product  of  two 
treatment  combinations  is  the  treatment  combination  containing  all  the 
letters  from  the  original  two  except  any  they  have  in  common.  Thus, 
for  example,  a  x  b  ■  ab,  abc  x  cd  «  abd,  and  ab  X  ab  «  (l)  . 

There  are  Qfy  *  1320  possible  ways  of  choosing  four  treatment  com¬ 
binations  to  be  omitted,  but  many  of  these  choices  lead  to  equivalent 
designs .  Two  designs  vill  be  considered  equivalent  if  one  can  be 
obtained  from  the  other  by  a  combination  of  one  or  both  of  the  following 
tvo  operations:  l)  rearranging  the  factors,  and  2)  reversing,  for  any 
factors,  the  high  and  the  low  levels.  It  follows  that  we  may,  without 
loss  of  generality,  restrict  attention  to  only  those  quadruples  which 
contain  any  given  fixed  treatment  combination;  the  treatment  combination 
(l)  will  be  so  chosen.  The  39  distinctive  sets  of  four  treatment  com¬ 
binations  given  in  Table  I  all  contain  the  identity  (l)  and  are  such 
that  one  cannot  be  obtained  from  another  by  rearranging  the  factors. 
Thus,  given  any  set  of  four  treatment  combinations  containing  (l),  by 
relabeling  the  variables  one  of  the  sets  in  Table  I  will  be  obtained. 

For  example,  [(l),  c,  abd,  ac]  becomes  C(l),  a,  ah,  bed]  which  is 
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TABLE  I 


DISTINCTIVE  FOUR-POINT  CONFIGURATIONS 


b,  c 

(l),a,b,ab 

(l),a,b,ac 

(l),a,b,cd 

(i),a,b,abc 

(l),a,b,acd 

(l),a,b,abcd 

(l),a,ab,ac  -  [(l),a,b,c] 
(l),a,ab,bc  *  [(l),a,b,abc] 
(l),a,ab,cd  -  [(l),a,b,acd] 
(l),a,bc,bd 

(l),a,ab,abc  *  [(l),a,b,ac] 

(l),a,ab,acd  *  [(l),a,b,cd] 

(l),a,ab,bcd  »  [(l),a,b,abcd] 

(l),a,bc,abc 

(l),  a,  be,  abd 

(l),  a,  be,  bed 

(l),a,ab,abcd  »  [(l),a,b,acd] 
(l),  a,  be,  abed 

(l),a,abc, abd  ■  [(l),a,bc,bd] 


(l),a,abc,bcd  -  [(l), a, be, abed] 

(l),a,abc,abcd  «  [(l),a,bc,bcd] 

(l),  a,  bed,  abed 

(l),ab,  ac,ad 

(l),ab,ac,bc 

(l),ab,  ac,bd 

(l),ab, ac,abc  «  C(l),a,b,iibc] 
(l),ab,ac,abd  =  [(l),a,bc,bd] 
(l),ab,ac,bcd 

(l),ab, cd,abc  «  [(l), a, be, abed] 
(l),ab,ac,abcd  »  C(l),ab,ac,bd] 
(l),ab,  cd, abed 

(l),ab,abc,abd  »  [(l),a,b,cd] 
(l),ab,abc,acd  »  [(l), a, be, abd] 
(l),ab,acd,bcd 

(l),ab,abc,abcd  »  C(l),a,b,acd] 
(l),ab,acd,abcd  «  [(l)  ,a,bc,abcd] 
(l),abc,abd,acd  ■*  C(l),ab,ac,bcd] 
(l),abc, abd, abed  «  [(l),a,b,abcd] 
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the  14th  entry.  Tbe  process  of  interchanging  the  levels  of  one  or  more 
factors  Is  equivalent  to  multiplying  each  combination  in  the  set  of  four 
by  another  treatment  combination,  namely  that  containing  the  letters  for 
the  factors  whose  levels  are  changed.  Since  by  convention  each  set  is 
required  to  contain  the  Identity  (l),  all  possible  equivalent  sets  can 
be  obtained  by  multiplying  the  given  set  by  each  of  its  elements  in  turn. 
Thus,  starting  vith  the  set  [(l),a,b,acd],  three  equivalent  sets  are 
obtained  as  follows: 


TABLE  II 


Multiplier 

Derived  Set 

Standard  Form 

a) 

(l),a,b,acd 

(l),a,b,  acd 

a 

a,  (l),ab,cd 

(l),a,ab,  cd 

b 

b,ab,  (l),abcd 

(l),a,ab,abcd 

acd 

acd, cd,  abed, (l) 

(l)  ,ab,abc,abcd 

The  final  column,  labeled  "Standard  Form",  is  obtained  from  the  second 
by  renaming  and  rearranging  the  factors  so  that  the  sets  are  in  the  same 
form  as  in  Table  I.  In  Table  I  those  sets  which  are  equivalent  to  sets 
preceding  them  in  the  table  have  the  equivalent  set  given  after  them  in 
square  brackets. 


By  exmiratioo.  of  Table  I  it  can  be  seen  that  there  are  exactly  nineteen 
equivalence  classes  to  which  the  possible  twelve-run  designs  belong. 

For  those  designs  which  are  noosinguLar,  the  variances  of  the  estimates 
using  these  designs  are  given  in  Table  in*  Each  variance  is  actually 
a  multiple  of  a2,  and  only  the  coefficients  of  a2  are  given  in  the 
table .  Those  designs  from  which  it  is  Impossible  to  estimate  all 
eleven  parameters  have  the  notation  "singular"  entered  in  the  table. 

The  designs  proposed  by  John  are  numbers  15  and  19  in  Table  HI.  Both 
have  the  property  that  the  four  treatment  combinations  omitted  are  sub¬ 
groups,  so  that  these  designs  are  fractional  factorials.  The  only  other 
designs  with  this  property  are  the  four  singular  designs.  Both  of  John's 
designs  have  the  variance  of  each  estimate  equal  to  o2 /8  except  for 
two  estimates  which  have  variance  3o2/32 .  These  are  the  grand  mean 
and  a  min  effect  for  15  and  the  grand  mean  and  an  interaction  for  19* 
Since  main  effects  are  ordinarily  of  more  interest,  15  would  often  be 
preferable  to  19,  as  John  points  out. 

Three  other  designs  appear  to  have  merit;  these  are  numbers  lk,  16,  and 
17  •  The  average  variance  of  the  estimates  is  as  low  as  possible  for 
designs  Ik,  15,  17,  and  19.  Design  number  lk  is  for  most  purposes  the 
best  twelve-run  design  available,  since  it  has  the  added  advantage  that 
each  main  effect  is  estimated  with  a  variance  which  is  below  this 
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TABLE  III 


VARIANCES  OF  ESTIMATES  FOR  12 -RUN  2k  DESIGNS 
Humber  Combinations  Variances  (Times  128) 


Omitted 

Mean 

A 

B 

C 

D 

AB 

AC 

AD 

BC 

BD 

CD 

Det  X'X 

1 

(l)  c 

48 

28 

28 

28 

48 

16 

16 

28 

16 

28 

28 

•34X1011 

2 

(l),a,b,ab 

singular 

0 

3 

(l),a,b,ac 

32 

28 

28 

28 

32 

16 

16 

28 

16 

28 

28 

•34X1011 

4 

(l),a,b,  cd 

32 

16 

16 

60 

60 

16 

28 

28 

28 

28 

32 

•34X1011 

5 

(l),a,b,abc 

32 

28 

28 

60 

32 

16 

16 

28 

16 

28 

60 

•34X1Q11 

6 

(l),a,b,  acd 

32 

16 

16 

28 

28 

16 

28 

28 

20 

28 

32 

.34X1011 

7 

(l),a,b,abcd 

48 

16 

16 

28 

28 

16 

28 

28 

28 

28 

48 

.34X1011 

8 

(l),a,bc,bd 

16 

14 

32 

24 

24 

14 

14 

14 

24 

32 

24 

.69KL011 

9 

(l),a,bc,abc 

singular 

0 

10 

(l),a,bc,abd 

16 

14 

32 

24 

24 

14 

14 

14 

24 

24 

32 

.69X1011 

11 

(l),  a,  be,  bed 

32 

16 

28 

28 

16 

23 

28 

16 

32 

28 

28 

•34X1011 

12 

(l),  a,  be,  abed 

32 

16 

28 

28 

16 

28 

28 

16 

32 

60 

60 

.34X1011 

13 

(l),  a,  bed,  abed 

singular 

0 

14 

(l),ab,  ac,ad 

12 

15 

15 

15 

15 

16 

16 

16 

16 

16 

16 

.14X1012 

13 

(l),ab,ac,bc 

12 

16 

16 

16 

12 

16 

16 

16 

16 

16 

16 

.lkXu.012 

16 

(l),ab,  ac,bd 

16 

14 

14 

14 

14 

32 

24 

24 

24 

24 

32 

.69X1Q11 

17 

(l),ab,ac,  bed 

12 

16 

16 

16 

15 

16 

16 

15 

16 

15 

15 

.14X1012 

18 

(l),ab,  cd,abcd 

singular 

0 

19 

(l),ab,  acd,  bed 

12 

16 

16 

16 

16 

16 

16 

16 

16 

16 

12 

.14X1012 
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average.  Design  1 6  would  be  useful  if  the  interactions  were  of  only 
incidental  interest,  since  the  sain  effects  have  the  lowest  average 
variance  with  this  design.  Design  17  would  be  useful  if  one  factor  (d) 
aid.  its  interactions  were  of  the  most  interest. 

It  should  be  noted  that  design  Is-  is  equivalent  to  a  pemutat ion- 
invariant  design.  The  concept  of  permutation- invariance,  which  I  intro  - 
|  duced  in  another  paper  [2],  Implies  that  all  factors  are  treated  alike  in 
the  sense  that  the  cross-product  matrix  remains  unaltered  if  the  factors 
are  permuted.  By  multiplying  each  treatment  combination  by  a  the 

5 

coed) Illations  omitted  are  seen  to  be  a,  b,  c,  and  d,  and  this  set  can 
be  seen  to  be  permutation- invariant .  Design  14  is  the  only  design  of 

i 

the  nineteen  which  is  equivalent  to  a  permutation- invariant  design. 

< 

i 

1  ; 

!  HEW  BEBMUTATIOV-IHVARIABT  RKSGLUTIQl-yiVE  DKSIGHS 

j  t 

The  class  of  fractional  factorials  is  deficient  for  some  experimental 
situations,  one  of  which  is  the  case  of  the  2 1  when  Interactions  are  of 
interest.  Although  only  29  estimates  are  to  be  obtained,  the  smallest 
}  fractional  factorial  is  the  half -replicate,  containing  64  runs .  This 
fact  is  especially  disappointing  when  one  recalls  that  6  factors  can  be 
accommodated  in  the  same  nmsber  of  runs.  Therefore  there  is  considerable 

i 

i 

I 
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interest  in  finding  an  incomplete  factorial  fro*  which  all  the  effects 
and  Interactions  can  be  estimated,  efficiently  and  which  is  considerably 
smaller  than  the  saallest  fractional  factorial  available.  Two  such 
designs,  both  permutation-invariant,  have  been  found  for  this  experi¬ 
mental  situation.  They  were  derived  by  analogy  with  the  permutation- 

k 

invariant  12 -run  2  design  described  above. 

the  first  of  the  new  2}  designs  ve  will  consider  contains  just  29  rurs, 
t  9  smallest  possible  number.  These  runs  are  as  follows:  i)  one  run  at 
the  treatment  combination  (l);  il)  one  run  at  each  of  the  21  two-letter 
treatment  combinations  (i.e.,  ab,  ac,  •••  ,  fg);  and  ill)  one  run  at 
each  of  the  7  six -letter  treatment  combinations  (i.e.,  abedef,  , 
bedefg) .  The  estimate  of  the  grand  mean  is 

-  (1/6)Y(1)  +  (1/24)  +  •••  +  Here  Y(l),  Iab,  etc., 

denote  the  response  at  treatment  combinations  (l),  ab,  etc.  Its 
variance  Is  .076a2.  The  estimate  of  the  main  effect  of  factor  A  is 
A  *  -  (1/2U)Y(1)  +  (1/24)  [Yab+  ...  +  Y^]  -  (1/46)  [Y^*  •••  +  Yfg] 

+  (V2<0  Clrtcdef+  -  *  Ttaodefg]  -  (7/48)  hede{e.  The  estlmtee  of 
the  other  main  effects  are  analogous .  The  variances  of  the  main-effect 
estimates  are  each  .050a2 .  The  estimate  of  AB  is 
AB  -  (1/24)y(1j  +  (7/48)1^  -  (1/24)  [Yac  +  •  •  •  + 

+  (1/W)  ♦  -  +  Yfg]  -  OM)  CYacdefg  +  Ytedefg] 

+  (1/46)  [Yabcdef  +  *••  +  Yabdefg3-  ®ie  oHher  interactions  are 
defined  analogously.  The  variance  of  these  estimates  is  also  .050a2. 
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In  comparing  incomplete  factorial  designs  it  is  useful  to  define  the 
efficiency  of  a  design  for  a  given  estimate  as  the  ratio  of  the  per-run 
information  with  the  given  design  to  the  per-run  information  with  the 
full  factorial,  where  the  information  is  the  reciprocal  of  the  variance. 
For  this  design  the  efficiencies  are  .45  tor  the  grand  mean  and  .68 
for  the  main  effects  and  interactions. 

The  second  of  the  new  2}  designs  contains  l)  one  run  at  (l);  il)  one 

run  at  the  21  two-letter  treatment  combinations;  and  ill)  two  runs  at 

/ 

each  of  the  7  six-letter  combinations .  Thus,  the  second  design  can  be 
obtained  from  the  first  by  duplicating  each  of  the  six-letter  treatment 
combinations.  This  36-run  design  has  an  efficiency  of  .40  for  the 
mean,  .81  for  the  main  effects,  and  .58  for  the  two-factor  inter¬ 
actions.  The  high  efficiency  of  the  main-effect  estimates  and  the  fact 
that  the  design  contains  seven  duplicated  points  from  which  an  estimate 
of  pure  experimental  error  can  be  obtained,  make  this  a  particularly 
attractive  design. 

A  six-factor  design  analogous  to  the  first  seven-factor  design  has  also 
been  investigated.  This  design  contains  22  runs  as  follows:  i)  one  at 
(l);  ii)  one  at  each  two-letter  combination;  and  ill)  one  at  each  five- 
letter  combination.  The  efficiencies  are  quite  high,  namely  .83  for 
the  mean  and  .87  for  the  main  effects  and  interactions* 
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